loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Nada Boudegzdame 1 ; Karima Sedki 1 ; Rosy Tspora 2 ; 3 ; 4 and Jean-Baptiste Lamy 1

Affiliations: 1 LIMICS, INSERM, Université Sorbonne Paris Nord, Sorbonne Université, France ; 2 INSERM, Université de Paris Cité, Sorbonne Université, Cordeliers Research Center, France ; 3 HeKA, INRIA, France ; 4 Department of Medical Informatics, Hôpital Européen Georges-Pompidou, AP-HP, France

Keyword(s): Imbalanced Data, Oversampling, SMOTE, Data Augmentation, Class Imbalance, Machine Learning, Neural Networks, Synthetic Data.

Abstract: Oversampling algorithms are used as preprocess in machine learning, in the case of highly imbalanced data in an attempt to balance the number of samples per class, and therefore improve the quality of models learned. While oversampling can be effective in improving the performance of classification models on minority classes, it can also introduce several problems. From our work, it came to light that the models learn to detect the noise added by the oversampling algorithms instead of the underlying patterns. In this article, we will define oversampling, and present the most common techniques, before proposing a method for evaluating oversampling algorithms.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.223.158.160

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Boudegzdame, N.; Sedki, K.; Tspora, R. and Lamy, J. (2024). SMOTE: Are We Learning to Classify or to Detect Synthetic Data?. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-680-4; ISSN 2184-433X, SciTePress, pages 283-290. DOI: 10.5220/0012325300003636

@conference{icaart24,
author={Nada Boudegzdame. and Karima Sedki. and Rosy Tspora. and Jean{-}Baptiste Lamy.},
title={SMOTE: Are We Learning to Classify or to Detect Synthetic Data?},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2024},
pages={283-290},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012325300003636},
isbn={978-989-758-680-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - SMOTE: Are We Learning to Classify or to Detect Synthetic Data?
SN - 978-989-758-680-4
IS - 2184-433X
AU - Boudegzdame, N.
AU - Sedki, K.
AU - Tspora, R.
AU - Lamy, J.
PY - 2024
SP - 283
EP - 290
DO - 10.5220/0012325300003636
PB - SciTePress