loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Author: Julia Bondarenko

Affiliation: Helmut Schmidt University Hamburg (University of the Federal Armed Forces Hamburg), Germany

ISBN: 978-989-8111-99-9

Keyword(s): Resampling, Classification algorithm C4.5, Uniform/(truncated) Normal distribution, Kurtosis, Chi-squared test, Kolmogorov-Smirnov test, Traffic injuries number.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence and Decision Support Systems ; Enterprise Information Systems ; Informatics in Control, Automation and Robotics ; Intelligent Control Systems and Optimization ; Knowledge-Based Systems Applications ; Machine Learning in Control Applications

Abstract: In imbalanced data sets, classes separated into majority (negative) and minority (positive) classes, are not approximately equally represented. That leads to impeding of accurate classification results. Well balanced data sets assume uniform distribution. The approach we present in the paper, is based on directed oversampling of minority class objects with simultaneous undersampling of majority class objects, to balance non-uniform data sets, and relies upon the certain statistical criteria. The resampling procedure is carried out for the daily traffic injuries data sets. The results obtained show the improving of rare cases (positive class objects) identification with accordance to several performance measures.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.81.29.226

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Bondarenko J. and (2009). RESAMPLING BASED ON STATISTICAL PROPERTIES OF DATA SETS.In Proceedings of the 6th International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO, ISBN 978-989-8111-99-9, pages 143-148. DOI: 10.5220/0002171701430148

@conference{icinco09,
author={Julia Bondarenko},
title={RESAMPLING BASED ON STATISTICAL PROPERTIES OF DATA SETS},
booktitle={Proceedings of the 6th International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO,},
year={2009},
pages={143-148},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002171701430148},
isbn={978-989-8111-99-9},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO,
TI - RESAMPLING BASED ON STATISTICAL PROPERTIES OF DATA SETS
SN - 978-989-8111-99-9
AU - Bondarenko, J.
PY - 2009
SP - 143
EP - 148
DO - 10.5220/0002171701430148

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.