An increase in home-field advantage (+5%) resulted
in a 0.02 increase in predicted medal counts,
representing approximately a 20% relative increase.
Conversely, a reduction of the home-field advantage
(-5%) led to a 0.01 decrease in predicted medal counts,
corresponding to a 10% reduction. These findings
confirm the importance of home-field advantage,
albeit with a lower sensitivity compared to participant
numbers.
To sum up, the model maintains stable prediction
trajectories over multiple input scenarios, also,
reasonably responding to the changes of the key
variables. That means it further demonstrates the
robustness and adaptability of the model. What’s
more, the mode of the sensitivity test curves matches
the actual situation, confirming the overall reliability
of the results.
5.3 Advantage and Disadvantage
5.3.1 Advantage
The model incorporates multiple dimensions of
features, such as historical performance, sports
participation, athlete factors, and host country effects.
This multi-dimensional approach allows the model to
consider a wide range of influencing factors,
enhancing both the accuracy and comprehensiveness
of its predictions.
XGBoost, with its strong regularization
capabilities, is particularly well-suited for handling
large-scale datasets and reducing overfitting.
Moreover, it excels at capturing non-linear
relationships and complex interactions between
features. Based on the provided training and testing
results, the model achieved an R² of 0.986 on the
training set and 0.827 on the test set, demonstrating
excellent performance. These results indicate that the
model not only fits the training data well but also
generalizes effectively to unseen data.
5.3.2 Disadvantage
The model depends heavily on a substantial amount
of historical data, athlete participation, and event
details. Missing or inaccurate data in any of these
areas could cause instability or reduce the accuracy of
the model’s predictions.
6 CONCLUSION
In conclusion, this study aimed to address the
challenge of predicting Olympic medal counts. The
paper constructed the HEAH model framework,
incorporating historical performance, engagement in
sports, athlete factors, and host effect. By integrating
the XGBoost algorithm with hyperparameters tuned
by TPE optimization, our model demonstrated
excellent performance, achieving an 𝑅
of 0.986 on
the training set and 0.827 on the test set. The model
successfully provided predictions for medal tables,
identified countries with potential improvement or
decline, predicted first-time medallists, and analysed
the relationship between events and medals.
However, it has limitations, such as reliance on
accurate data. Overall, this research offers valuable
insights into Olympic medal prediction and can serve
as a reference for future studies in sports performance
prediction.
REFERENCES
Baimbridge, M. 1998. Outcome uncertainty in sporting
competition: the Olympic Games 1896–1996. Applied
Economics Letters, 5(3), 161-164.
Bergstra, J., Bardenet, R., Bengio, Y., & Kégl, B. 2011.
Algorithms for hyper-parameter optimization. Advances
in neural information processing systems, 24.
Cetinkaya, A., Peker, S., & Kuvvetli, Ü. 2024. Analysis of
countries' performances in individual Olympic Games
using cluster analysis and decision trees: the case of
Tokyo 2020. Sport, Business and Management: An
International Journal, 14(5/6), 648-666.
Chen, T., & Guestrin, C. 2016. Xgboost: A scalable tree
boosting system. In Proceedings of the 22nd acm sigkdd
international conference on knowledge discovery and
data mining (pp. 785-794).
Csurilla, G., & Fertő, I. 2024. How to win the first Olympic
medal? And the second? Social Science
Quarterly, 105(5), 1544-1564.
Donald, W. B. 1972. Olympic games competition:
structural correlates of national success. International
Journal of Comparative Sociology, 13, 186.
Kuper, G. H., & Sterken, E. 2001. Olympic participation
and performance since 1896. Available at SSRN
274295.
Leeds, E. M. 2019. Olympic Performance. The SAGE
Handbook of Sports Economics, 377.
Malfas, M., Theodoraki, E., & Houlihan, B. 2004. Impacts
of the Olympic Games as mega-events. In Proceedings
of the Institution of Civil Engineers-Municipal
Engineer (Vol. 157, No. 3, pp. 209-220). Thomas
Telford Ltd.
Sagala, N. T., & Ibrahim, M. A. 2022. A Comparative
Study of Different Boosting Algorithms for Predicting
Olympic Medal. In 2022 IEEE 8th International
Conference on Computing, Engineering nd Design
(ICCED) (pp. 1-4). IEEE.
Schlembach, C., Schmidt, S. L., Schreyer, D., &
Wunderlich, L. 2022. Forecasting the Olympic medal