loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Camelia Vidrighin Bratu and Rodica Potolea

Affiliation: Technical University of Cluj-Napoca, Romania

Keyword(s): Preprocessing, Unified Methodology, Feature Selection, Data Imputation.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Artificial Intelligence and Decision Support Systems ; Biomedical Engineering ; Biomedical Signal Processing ; Business Analytics ; Computational Intelligence ; Data Engineering ; Data Mining ; Databases and Information Systems Integration ; Datamining ; Enterprise Information Systems ; Health Engineering and Technology Applications ; Health Information Systems ; Human-Computer Interaction ; Methodologies and Methods ; Neural Network Software and Applications ; Neural Networks ; Neurocomputing ; Neurotechnology, Electronics and Informatics ; Pattern Recognition ; Physiological Computing Systems ; Sensor Networks ; Signal Processing ; Soft Computing ; Theory and Methods

Abstract: Data-related issues represent the main obstacle in obtaining a high quality data mining process. Existing strategies for preprocessing the available data usually focus on a single aspect, such as incompleteness, or dimensionality, or filtering out “harmful” attributes, etc. In this paper we propose a unified methodology for data preprocessing, which considers several aspects at the same time. The novelty of the approach consists in enhancing the data imputation step with information from the feature selection step, and performing both operations jointly, as two phases in the same activity. The methodology performs data imputation only on the attributes which are optimal for the class (from the feature selection point of view). Imputation is performed using machine learning methods. When imputing values for a given attribute, the optimal subset (of features) for that attribute is considered. The methodology is not restricted to the use of a particular technique, but can be applied usi ng any existing data imputation and feature selection methods. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.81.79.135

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Vidrighin Bratu, C. and Potolea, R. (2009). TOWARDS A UNIFIED STRATEGY FOR THE PREPROCESSING STEP IN DATA MINING. In Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 2: ICEIS; ISBN 978-989-8111-85-2; ISSN 2184-4992, SciTePress, pages 230-235. DOI: 10.5220/0002008902300235

@conference{iceis09,
author={Camelia {Vidrighin Bratu}. and Rodica Potolea.},
title={TOWARDS A UNIFIED STRATEGY FOR THE PREPROCESSING STEP IN DATA MINING},
booktitle={Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 2: ICEIS},
year={2009},
pages={230-235},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002008902300235},
isbn={978-989-8111-85-2},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 2: ICEIS
TI - TOWARDS A UNIFIED STRATEGY FOR THE PREPROCESSING STEP IN DATA MINING
SN - 978-989-8111-85-2
IS - 2184-4992
AU - Vidrighin Bratu, C.
AU - Potolea, R.
PY - 2009
SP - 230
EP - 235
DO - 10.5220/0002008902300235
PB - SciTePress