SMOTE: Are We Learning to Classify or to Detect Synthetic Data?

Nada Boudegzdame; Karima Sedki; Rosy Tspora; Rosy Tspora; Rosy Tspora; Jean-Baptiste Lamy

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

SMOTE: Are We Learning to Classify or to Detect Synthetic Data?

Topics: AI and Creativity; Data Science; Machine Learning; Neural Networks

In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART, 283-290, 2024 , Rome, Italy

Authors: Nada Boudegzdame ¹ ; Karima Sedki ¹ ; Rosy Tspora ^{2

;

3

;

4} and Jean-Baptiste Lamy ¹

Affiliations: ¹ LIMICS, INSERM, Université Sorbonne Paris Nord, Sorbonne Université, France ; ² INSERM, Université de Paris Cité, Sorbonne Université, Cordeliers Research Center, France ; ³ HeKA, INRIA, France ; ⁴ Department of Medical Informatics, Hôpital Européen Georges-Pompidou, AP-HP, France

Keyword(s): Imbalanced Data, Oversampling, SMOTE, Data Augmentation, Class Imbalance, Machine Learning, Neural Networks, Synthetic Data.

Abstract: Oversampling algorithms are used as preprocess in machine learning, in the case of highly imbalanced data in an attempt to balance the number of samples per class, and therefore improve the quality of models learned. While oversampling can be effective in improving the performance of classification models on minority classes, it can also introduce several problems. From our work, it came to light that the models learn to detect the noise added by the oversampling algorithms instead of the underlying patterns. In this article, we will define oversampling, and present the most common techniques, before proposing a method for evaluating oversampling algorithms.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.157

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Boudegzdame, N., Sedki, K., Tspora, R., Lamy and J.-B. (2024). SMOTE: Are We Learning to Classify or to Detect Synthetic Data?. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-680-4; ISSN 2184-433X, SciTePress, pages 283-290. DOI: 10.5220/0012325300003636

@conference{icaart24,
author={Nada Boudegzdame and Karima Sedki and Rosy Tspora and Jean{-}Baptiste Lamy},
title={SMOTE: Are We Learning to Classify or to Detect Synthetic Data?},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2024},
pages={283-290},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012325300003636},
isbn={978-989-758-680-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - SMOTE: Are We Learning to Classify or to Detect Synthetic Data?
SN - 978-989-758-680-4
IS - 2184-433X
AU - Boudegzdame, N.
AU - Sedki, K.
AU - Tspora, R.
AU - Lamy, J.
PY - 2024
SP - 283
EP - 290
DO - 10.5220/0012325300003636
PB - SciTePress