Parameter-Free Undersampling for Multi-Label Data

Sarbani Palit; Payel Sadhukhan

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Parameter-Free Undersampling for Multi-Label Data

Topics: Data Mining; Data Science

In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, 397-406, 2024 , Rome, Italy

Authors: Sarbani Palit ¹ and Payel Sadhukhan ²

Affiliations: ¹ Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India ; ² Institute for Advancing Intelligence, TCG CREST, Kolkata, India

Keyword(s): Multi-Label, Natural Nearest Neighborhood, Class Imbalance, Undersampling.

Abstract: This work presents a novel undersampling scheme to tackle the imbalance problem in multi-label datasets. We use the principles of the natural nearest neighborhood and follow a paradigm of label-specific undersam-pling. Natural-nearest neighborhood is a parameter-free principle. Our scheme’s novelty lies in exploring the parameter-optimization-free natural nearest neighborhood principles. The class imbalance problem is particularly challenging in a multi-label context, as the imbalance ratio and the majority-minority distributions vary from label to label. Consequently, the majority-minority class overlaps also vary across the labels. Working on this aspect, we propose a framework where a single natural neighbor search is sufficient to identify all the label-specific overlaps. Natural neighbor information is also used to find the key lattices of the majority class (which we do not undersample). The performance of the proposed method, NaNUML, indicates its ability to mitigate the class -imbalance issue in multi-label datasets to a considerable extent. We could also establish a statistically superior performance over other competing methods several times. An empirical study involving twelve real-world multi-label datasets, seven competing methods, and four evaluating metrics - shows that the proposed method effectively handles the class-imbalance issue in multi-label datasets. In this work, we have presented a novel label-specific undersampling scheme, NaNUML, for multi-label datasets. NaNUML is based on the parameter-free natural neighbor search and the key factor, neighborhood size ’k’ is determined without invoking any parameter optimization. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.219

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Palit, S. and Sadhukhan, P. (2024). Parameter-Free Undersampling for Multi-Label Data. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-758-680-4; ISSN 2184-433X, SciTePress, pages 397-406. DOI: 10.5220/0012401400003636

@conference{icaart24,
author={Sarbani Palit and Payel Sadhukhan},
title={Parameter-Free Undersampling for Multi-Label Data},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2024},
pages={397-406},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012401400003636},
isbn={978-989-758-680-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - Parameter-Free Undersampling for Multi-Label Data
SN - 978-989-758-680-4
IS - 2184-433X
AU - Palit, S.
AU - Sadhukhan, P.
PY - 2024
SP - 397
EP - 406
DO - 10.5220/0012401400003636
PB - SciTePress