loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Jesús Mª Pérez ; Javier Muguerza ; Olatz Arbelaitz ; Ibai Gurrutxaga and Jose I. Martin

Affiliation: University of the Basque Country, Spain

Abstract: Many machine learning areas use subsampling techniques with different objectives: reducing the size of the training set, equilibrate the class imbalance or non-uniform cost error, etc. Subsampling affects severely to the behavior of classification algorithms. Decision trees induced from different subsamples of the same data set are very different in accuracy and structure. This affects the explanation of the classification; very important in some domains. This paper presents a new methodology for building decision trees. The final classifier is a single decision tree, so that it maintains the explaining capacity of the classification. A comparison in error and structural stability of our algorithm and the C4.5 algorithm is done. The decision trees generated using the new algorithm, achieve smaller error rates and structurally more steady trees than C4.5 when using subsampling techniques.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.117.81.240

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Mª Pérez, J.; Muguerza, J.; Arbelaitz, O.; Gurrutxaga, I. and I. Martin, J. (2004). Behavior of Consolidated Trees when using Resampling Techniques. In Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems (ICEIS 2004) - PRIS; ISBN 972-8865-01-5, SciTePress, pages 139-148. DOI: 10.5220/0002665601390148

@conference{pris04,
author={Jesús {Mª Pérez}. and Javier Muguerza. and Olatz Arbelaitz. and Ibai Gurrutxaga. and Jose {I. Martin}.},
title={Behavior of Consolidated Trees when using Resampling Techniques},
booktitle={Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems (ICEIS 2004) - PRIS},
year={2004},
pages={139-148},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002665601390148},
isbn={972-8865-01-5},
}

TY - CONF

JO - Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems (ICEIS 2004) - PRIS
TI - Behavior of Consolidated Trees when using Resampling Techniques
SN - 972-8865-01-5
AU - Mª Pérez, J.
AU - Muguerza, J.
AU - Arbelaitz, O.
AU - Gurrutxaga, I.
AU - I. Martin, J.
PY - 2004
SP - 139
EP - 148
DO - 10.5220/0002665601390148
PB - SciTePress