2 METHOD
The method and theory adopted by the prediction is
the model of random forest, which is a commonly used
machine learning algorithm trademarked by Leo
Breiman and Adele Cutler, which combines the output
of multiple decision trees to reach a single result. The
advantage of the ease of use and flexibility of the
model is the main contribution to its adoption, as it
provides solutions for both classification and
regression problems.
3 GENESIS ANALYSIS
To predict the probability of stroke, the contribution
of stroke is necessary to be analyzed.
According to the research, the causes and
contributions of stroke are complicated while most of
them are chronic diseases except for variables such as
gender and age (Wenlong 2018, Rui et al 2021 &
Wang et al 2023).
3.1 Hypertension
Hypertension accounts for 35%~50% of the risk of
having a stroke (Wenlong 2018). Epidemiological
studies have shown that if blood pressure is reduced
by 5~6 mmHg systolic blood pressure, 2~3 mmHg
diastolic blood pressure will also reduce the risk of
stroke by 40% (Rui et al 2021). Studies have further
shown that lowering blood pressure is effective in
preventing both ischemic and hemorrhagic stroke
(Xiaoxia et al 2022). Antihypertensive therapy is
beneficial for both elderly patients over 80 years of
age and isolated systolic hypertension (Yewen 2020).
Recent studies have shown that high-intensity
antihypertensive therapy reduces the risk of stroke
even more (Leal et al 2020).
3.2 Heart Disease
Atrial fibrillation, which is a type of heart disease, is
the most common arrhythmia in clinical practice, with
a 5% risk of induced stroke per year, and an even
higher risk of stroke in patients with valvular atrial
fibrillation (Rowan et al 2019). Atrial fibrillation can
cause poor blood flow, so the blood in the atria will
clot more easily, causing blood clots and blood clots
in the atrium, when the blood clots flow to the brain
with the blood circulation, it will block the cerebral
arteries, resulting in stroke. Studies have shown that
patients with atrial fibrillation are 5 times more likely
to have a stroke than normal people, the recurrence
rate of stroke caused by atrial fibrillation is high, and
the prognosis is poor. Statistics show that 1 in 20
patients with atrial fibrillation will have a stroke
within 1 year (Paray 2023 & Rowan et al 2019).
Therefore, for patients with atrial fibrillation, how to
effectively prevent stroke is a very important issue
(Wang et al 2023).
3.3 High Cholesterol
The relationship between high cholesterol and stroke
remains unclear (Rui et al 2021). However, data from
meta-analysis has shown that statins reduce the risk of
stroke by about 15% (Seo et al 2017). Other lipid-
lowering drugs have little effect on stroke reduction
(Xiaoxia et al 2022). Recent studies show that statins
may reduce stroke through other mechanisms (Rowan
et al 2019).
3.4 Diabetic
Studies have shown that stroke in diabetic patients is
2~3 times higher than that of ordinary people and
diabetes, is often accompanied by hypertension and
lipids. High-intensity diabetes treatment can be
reduced. Anticoagulant efficacy needs to be
suppressed (Rui et al 2021).
4 RESULTS & DISCUSSION
According to previous research on the contribution of
stroke, most of the contributions are chronic diseases,
which is able to be monitored by BMI and blood
glucose. Also, chronic diseases can be closely related
to age and gender (Seo et al 2017). As the prediction
demands a comparison, it is also necessary to form a
figure of the correlation between stroke patients and
non-stroke patients. Therefore, after collecting
meaningful data from Kaggle about the gender, age,
BMI, and blood glucose index of 200 stroke patients,
forming figures to show the relationships between the
index of stroke patients and non-stroke patients can be
the best solution (Rowan et al 2019). The result of data
visualization can be analyzed to be the foundation for
finding out the probability of stroke. The probability
of stroke is stretched at last as the final result.
According to the contributions above. The codes
aim to analyze the relationship between the variances
and stroke and form correlation figures. By analyzing
the correlation figures based on the Random Forrest
model, the prediction can stand at last.