prevalence rate of diabetes in China at the same
time. It is speculated that most residents in this
community have exercise habits. Therefore, the
influencing factors finally selected for this study are
age, BMI, exercise, blood lipids, and blood pressure.
4.3.2 Recommendations for Improvement
Further analysis shows that the prediction accuracy
of the grey prediction model for diabetes incidence is
mainly affected by the following factors: one is the
quality and completeness of the data. Because the
model depends on historical data, the accuracy and
completeness of the data are very important to the
prediction result. If the data are missing or
inaccurate, the prediction result may be biased. The
other is the assumptions of the model. The grey
prediction model assumes that the trend of the data is
stable, but in some years, unexpected events or other
external factors may cause large fluctuations in the
incidence rate, affecting the accuracy of the model.
Therefore, to cope with the violent fluctuations
and abrupt changes that may occur in the prediction
process, it is necessary to further improve the model
(Xie et al., 2009). Based on the grey prediction
model, age, BMI, exercise condition, blood pressure,
and blood lipid factors closely related to diabetes
incidence are introduced to carry out joint modeling
and establish a multivariate grey prediction model.
The second is model fusion and ensemble learning.
Combined with random forest, support vector
machine, and other machine learning models,
prediction is carried out by the ensemble learning
method (Zhang et al., 2020). Thus, the possible
overfitting problem of a single model can be
reduced, and the accuracy and robustness of
prediction can be improved. Third, cross-validation
and model optimization can be carried out. Dividing
the data set into multiple training sets and validation
sets and conducting multiple model training and
testing can effectively avoid the overfitting
phenomenon and selecting the optimal model
parameters to improve the prediction accuracy of the
grey prediction model.
Through the above optimization measures, the
accuracy and stability of the grey prediction model
in predicting the incidence of diabetes can be
effectively improved.
5 CONCLUSION
The results show that the grey prediction model can
capture the trend of diabetes incidence well, but it
has some shortcomings in prediction accuracy in
high-incidence years. This paper proposes some
optimization measures such as introducing external
variables, model fusion, and cross validation to
improve the prediction accuracy and stability of the
model. Further analysis shows that high age, high
Body Mass Index, lack of exercise, hypertension and
hyperlipidemia are the main risk factors for diabetes
incidence. There are differences in prevalence
between men and women. Overall, the incidence of
type 2 diabetes increases faster in men, and the
incidence of type 1 diabetes increases significantly
in women. This gender difference indicates that
gender-specific interventions are needed in diabetes
prevention and control. The results show that a
healthy lifestyle plays a key role in diabetes
prevention. Through regular physical examination,
healthy diet, regular exercise, and avoidance of
smoking, the risk of diabetes can be effectively
reduced, especially for high-risk groups with
hypertension and hyperlipidemia.
Based on the above findings, it is recommended
that individuals reduce the probability of developing
diabetes through regular physical examination and
exercise, especially those with hypertension and
hyperlipidemia, and strengthen prevention.
Communities should strengthen personal health
education and raise awareness of healthy lifestyles
and eating habits. In addition, governments can
strengthen cooperation with medical institutions to
carry out early screening and intervention for
high-risk groups to reduce the prevalence of diabetes
and related complications.
REFERENCES
Ahmed, M. A., El-Medany, W. M. & Fathy, S. M. 2020.
Machine learning algorithms for predicting diabetes
risk: A review. International Journal of Computer
Applications, 175(5), 18-26.
Chaurasia, V. & Pal, S. 2017. Diabetes disease prediction
using data mining techniques. 2nd International
Conference on Communication and Electronics
Systems (ICCES), 283-287.
Liu, C., Guo, M., Yuan, Y., Zheng, J. & Yang, J. 2023.
Gender difference and prediction of type 1 diabetes
incidence in Beijing from 2013 to 2018. Journal of
Capital Medical University, 44(03), 370-374.
Liu, S. & Forrest, J. Y. L. 2007. Advances in grey systems
research. Springer.
Wei, F. 2016. Research on grey prediction model in blood
sugar prediction. Master's Thesis, Zhengzhou
University.