loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Author: Julia Bondarenko

Affiliation: Helmut Schmidt University Hamburg (University of the Federal Armed Forces Hamburg), Germany

Keyword(s): Resampling, Classification algorithm C4.5, Uniform/(truncated) Normal distribution, Kurtosis, Chi-squared test, Kolmogorov-Smirnov test, Traffic injuries number.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence and Decision Support Systems ; Enterprise Information Systems ; Informatics in Control, Automation and Robotics ; Intelligent Control Systems and Optimization ; Knowledge-Based Systems Applications ; Machine Learning in Control Applications

Abstract: In imbalanced data sets, classes separated into majority (negative) and minority (positive) classes, are not approximately equally represented. That leads to impeding of accurate classification results. Well balanced data sets assume uniform distribution. The approach we present in the paper, is based on directed oversampling of minority class objects with simultaneous undersampling of majority class objects, to balance non-uniform data sets, and relies upon the certain statistical criteria. The resampling procedure is carried out for the daily traffic injuries data sets. The results obtained show the improving of rare cases (positive class objects) identification with accordance to several performance measures.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.141.193.158

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Bondarenko, J. (2009). RESAMPLING BASED ON STATISTICAL PROPERTIES OF DATA SETS. In Proceedings of the 6th International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO; ISBN 978-989-8111-99-9; ISSN 2184-2809, SciTePress, pages 143-148. DOI: 10.5220/0002171701430148

@conference{icinco09,
author={Julia Bondarenko.},
title={RESAMPLING BASED ON STATISTICAL PROPERTIES OF DATA SETS},
booktitle={Proceedings of the 6th International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO},
year={2009},
pages={143-148},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002171701430148},
isbn={978-989-8111-99-9},
issn={2184-2809},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO
TI - RESAMPLING BASED ON STATISTICAL PROPERTIES OF DATA SETS
SN - 978-989-8111-99-9
IS - 2184-2809
AU - Bondarenko, J.
PY - 2009
SP - 143
EP - 148
DO - 10.5220/0002171701430148
PB - SciTePress