Impact of Threshold Values for Filter-based Univariate Feature Selection in Heart Disease Classification

Houda Benhar, Ali Idri, Ali Idri, Mohamed Hosni

2020

Abstract

In the last decade, feature selection (FS), was one of the most investigated preprocessing tasks for heart disease prediction. Determining the optimal features which contribute more towards the diagnosis of heart disease can reduce the number of clinical tests needed to be taken by a patient, decrease the model cost, reduce the storage requirements and improve the comprehensibility of the induced model. In this study a comparison of three filter feature ranking methods was carried out. Feature ranking methods need to set a threshold (i.e. the percentage of the number of relevant features to be selected) in order to select the final subset of features. Thus, the aim of this study is to investigate if there is a threshold value which is an optimal choice for three different feature ranking methods and four classifiers used for heart disease classification in four heart disease datasets. The used feature ranking methods and selection thresholds resulted in optimal classification performance for one or more classifiers over small and large heart disease datasets. The size of the dataset takes an important role in the choice of the selection threshold.

Download


Paper Citation


in Harvard Style

Benhar H., Idri A. and Hosni M. (2020). Impact of Threshold Values for Filter-based Univariate Feature Selection in Heart Disease Classification. In Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 5: HEALTHINF; ISBN 978-989-758-398-8, SciTePress, pages 391-398. DOI: 10.5220/0008947403910398


in Bibtex Style

@conference{healthinf20,
author={Houda Benhar and Ali Idri and Mohamed Hosni},
title={Impact of Threshold Values for Filter-based Univariate Feature Selection in Heart Disease Classification},
booktitle={Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 5: HEALTHINF},
year={2020},
pages={391-398},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008947403910398},
isbn={978-989-758-398-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 5: HEALTHINF
TI - Impact of Threshold Values for Filter-based Univariate Feature Selection in Heart Disease Classification
SN - 978-989-758-398-8
AU - Benhar H.
AU - Idri A.
AU - Hosni M.
PY - 2020
SP - 391
EP - 398
DO - 10.5220/0008947403910398
PB - SciTePress