loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Daniel Rocha Franca 1 ; Caio Davi Rabelo Fiorini 2 ; Ligia Ferreira de Carvalho Gonçalves 2 ; Marta Dias Moreira Noronha 3 ; Mark Song 3 and Luis Enrique Zárate Galvez 3

Affiliations: 1 Bach. Computer Science, Pontifícia Universidade Católica de Minas Gerais, Rua Claudio Manuel, Belo Horizonte, Brazil ; 2 Bach. Data Science and Artificial Intelligence, Pontifícia Universidade Católica de Minas Gerais, Rua Claudio Manuel, Belo Horizonte, Brazil ; 3 Institute of Exact Sciences and Computer Science, Pontifícia Universidade Católica de Minas Gerais, Rua Claudio Manuel, Belo Horizonte, Brazil

Keyword(s): Hypercholesterolemia, Young Population, Machine Learning, Decision Tree, Genetic Algorithm, Data Mining, National Health Survey, Risk Factors, Data Preprocessing, Health Informatics, CAPTO.

Abstract: Understanding the risk factors associated with hypercholesterolemia in young individuals is crucial for developing preventive strategies to combat cardiovascular diseases. This study proposes a data mining pipeline employing machine learning techniques to profile high cholesterol in Brazilian youth aged 15 to 25, utilizing the 2019 National Health Survey (PNS) dataset. The PNS-2019 database has 1,088 attributes organized into 26 modules and 293,726 anonymized records. The Knowledge Discovery in Databases (KDD) process was implemented, incorporating a novel CAPTO-based conceptual attribute selection followed by feature selection using a Non-dominated Sorting Genetic Algorithm II (NSGA-II). A decision tree classifier was optimized and evaluated, achieving an F1 Score of 66%, demonstrating reasonable predictive power despite data limitations. The results highlight the significant impact of dietary habits, particularly high sugar and fat intake, on hyper-cholesterolemia risk. The study e mphasizes the potential for early identification and targeted interventions, contributing to public health improvements and laying the groundwork for future research with advanced models and additional data sources. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.139.239.16

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Franca, D. R., Fiorini, C. D. R., Gonçalves, L. F. C., Noronha, M. D. M., Song, M. and Galvez, L. E. Z. (2025). A Knowledge Discovery Pipeline to Describe the High Cholesterol Profile in Young People Using GA for Feature Selection. In Proceedings of the 27th International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-749-8; ISSN 2184-4992, SciTePress, pages 805-812. DOI: 10.5220/0013294800003929

@conference{iceis25,
author={Daniel Rocha Franca and Caio Davi Rabelo Fiorini and Ligia Ferreira de Carvalho Gon\c{c}alves and Marta Dias Moreira Noronha and Mark Song and Luis Enrique Zárate Galvez},
title={A Knowledge Discovery Pipeline to Describe the High Cholesterol Profile in Young People Using GA for Feature Selection},
booktitle={Proceedings of the 27th International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2025},
pages={805-812},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013294800003929},
isbn={978-989-758-749-8},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 27th International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - A Knowledge Discovery Pipeline to Describe the High Cholesterol Profile in Young People Using GA for Feature Selection
SN - 978-989-758-749-8
IS - 2184-4992
AU - Franca, D.
AU - Fiorini, C.
AU - Gonçalves, L.
AU - Noronha, M.
AU - Song, M.
AU - Galvez, L.
PY - 2025
SP - 805
EP - 812
DO - 10.5220/0013294800003929
PB - SciTePress