Sports Analytics: Maximizing Precision in Predicting MLB Base Hits

Pedro Alceo, Roberto Henriques

2019

Abstract

As the world of sports expands to never seen levels, so does the necessity for tools which provided material advantages for organizations and other stakeholders. The main objective of this paper is to build a predictive model capable of predicting what are the odds of a baseball player getting a base hit on a given day, with the intention of both winning the game Beat the Streak and to provide valuable information for the coaching staff. Using baseball statistics, weather forecasts and ballpark characteristics several models were built with the CRISP-DM architecture. The main constraints considered when building the models were balancing, outliers, dimensionality reduction, variable selection and the type of algorithm – Logistic Regression, Multi-layer Perceptron, Random Forest and Stochastic Gradient Descent. The results obtained were positive, in which the best model was a Multi-layer Perceptron with an 85% correct pick ratio.

Download


Paper Citation


in Harvard Style

Alceo P. and Henriques R. (2019). Sports Analytics: Maximizing Precision in Predicting MLB Base Hits. In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 1: KDIR; ISBN 978-989-758-382-7, SciTePress, pages 190-201. DOI: 10.5220/0008362201900201


in Bibtex Style

@conference{kdir19,
author={Pedro Alceo and Roberto Henriques},
title={Sports Analytics: Maximizing Precision in Predicting MLB Base Hits},
booktitle={Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 1: KDIR},
year={2019},
pages={190-201},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008362201900201},
isbn={978-989-758-382-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 1: KDIR
TI - Sports Analytics: Maximizing Precision in Predicting MLB Base Hits
SN - 978-989-758-382-7
AU - Alceo P.
AU - Henriques R.
PY - 2019
SP - 190
EP - 201
DO - 10.5220/0008362201900201
PB - SciTePress