tools. This would lead to better patient outcomes and
save lives on a global scale. The study's structure
includes sections on related work, dataset description,
feature analysis, correlation tests, experimental
results, and implications.
2 LITERATURE REVIEW
Cardiovascular disease (CVD) accounts for
significant, worsening morbidity and mortality
worldwide, and the development of predictive models
is highly essential. As the increased demand for
automatic diagnostic structures in hospital therapy
continues, progress in information technology has
specifically played a crucial role in the diagnosis of
CVD (Azmi, et al. , 2022). Subramani et al conducted
a study on machine learning (ML) and deep learning
models to Cardiovascular disease prediction. Dataset
used herein contains samples of 918 after removing
Duplicates that have data from sources like
Cleveland, Hungarian, etc. The results indicate
classification methods, such as Random Forest (RF),
Logistic Regression (LR),Multi- Layer Perceptron
(MLP), and CatBoost. A stacking model achieved the
best accuracy of nearly 96%, utilizing Gradient
Boosting Decision Trees (GBDT) and SHAP for
feature selection. The model’s performance was
evaluated through metrics such as accuracy,
precision, recall, and AUC (Subramani, et al. ,
2023).For instance, Pasha et al. published a work on
deep learning for cardiovascular disease
prediction. techniques. They took the Kaggle dataset,
which contains information like age, gender, blood
Pressure, and Cholesterol Different machine learning
algorithms were tried. These include Support Vector
Machines (SVM), K-Nearest Neighbor (KNN), and
Decision Trees (DT). The results revealed that These
algorithms did not perform well on large data sets.
Thus, the authors implemented an The Artificial
Neural Network, through TensorFlow Keras,
improved the degree of prediction up to 85.24%,
surpassing the other models (Pal, et al. , 2022).
Pal et al. did a study on the prediction of
cardiovascular disease (CVD) using machine
learning (ML).The study utilized the UCI repository
data set and concentrated only on 13 key attributes.
Two ML K-Nearest Neighbor (K-NN) and Multi-
Layer Perceptron (MLP) models were employed. The
MLP the model succeeded in achieving a higher
accuracy at 82.47% with an AUC of 86.41% than the
K-NN model. with the accuracy of 73.77%. In
conclusion, the MLP model was proven to be more
effective on which inputs. automatic CVD detection,
showing improved performance over all key metrics
(Ali, et al., 2021).This is an article, titled "Heart
disease prediction using supervised machine learning
algorithms:" Performance analysis and comparison,
discusses the prospect of supervised machine. some
learning algorithms for forecasting cardiovascular
diseases (CVD). According to data from Kaggle, the
The study applied multiple classifiers, including K-
Nearest Neighbors (KNN), Decision Tree (DT), and
Random Forest (RF). The results show that RF, KNN,
and DT are perfect accuracy algorithms. sensitivity,
and specificity, making them highly effective for
CVD prediction. The study highlights the
Importantly, feature selection allows for the
enhancement of prediction accuracy by
demonstrating that Use of Machine learning in
clinical decision-making about heart disease
diagnosis (Shah, et al. , 2020).
Shah et al. tested a data set of 303 instances with
14 attributes that were chosen from the Cleveland
database for heart disease prediction. The four
machine learning algorithms Naïve Bayes, Decision
Tree, K-Nearest Neighbor (KNN), and Random
Forest. KNN achieved the highest accuracy at
90.79%, followed by Naïve Bayes at 88.16%,
Random Forest at 86.84%, and Decision Tree at
80.26%. The author concluded that KNN, Naïve
Bayes and Random Forest are effective in predicting
heart disease, with KNN delivering the best
performance (Krittanawong, et al. , 2020). This meta-
analysis published, assessed the predictive capability
of machine learning algorithms in the prediction of
cardiovascular disease on more than 3.3 million
patients across 103 cohorts. Custom-built algorithms
with boosting techniques performed reasonably for
the prediction of coronary artery disease and reported
an AUC of up to 0.93. The SVM and CNN algorithms
worked best to predict stroke, with an AUC of up to
0.92. However, there is still a big variety between
different algorithms, indicating the choice made
should be carefully evaluated clinically.The article
named "Cardiovascular Disease Risk Prediction with
Supervised Machine Learning Techniques" and
addresses the long-term risk of CVD using machine
learning models to predict such risk. Finally,
comparisons of some supervised learning algorithms
are presented; namely Logistic Regression, Support
Vector Machines (SVM), Naive Bayes, and Random
Forest algorithms, with respect to three measures of
accuracy, recall, and AUC metrics. It is realized that
the highest accuracy yielded by the Logistic
Regression model is around 72.1% with AUC being
at 78.4%, suggesting the best model for predicting the
risk for CVD (Pasha, Ramesh, et al. , 2020).In the