
Data-Driven Model to Measure Pass Effectiveness in
Professional Soccer Matches. Big Data, 7(1):57–70.
Huang, K. H., Chen, L., and Chang, K. W. (2020). Generat-
ing Sports News from Live Commentary: A Chinese
Dataset for Sports Game Summarization. In Proc. 1st
Conf. of the Asia-Pacific Chapter of the Ass. for Com-
putational Linguistics and the 10th Int. Joint Conf. on
Natural Language Processing, pages 609–615.
Huang, Y., Wan, L. J., Ye, H., Jha, M., Wang, J., Li, Y.,
Zhang, X., and Chen, D. (2024). Invited: New Solu-
tions on LLM Acceleration, Optimization, and Appli-
cation. In Proceedings of the 61st ACM/IEEE Design
Automation Conference, pages 1–4.
Jehangir, B., Radhakrishnan, S., and Agarwal, R. (2023).
A survey on Named Entity Recognition — datasets,
tools, and methodologies. Natural Language Process-
ing Journal, 3:1–12.
Jeong, C. (2024). Domain-specialized LLM: Finan-
cial fine-tuning and utilization method using Mistral
7B. Journal of Intelligence and Information Systems,
30(1):93–120.
Jiang, L., Jiang, K., Chu, X., Gulati, S., and Garg, P.
(2024). Hallucination Detection in LLM-enriched
Product Listings. In Malmasi, S., Fetahu, B., Ueffing,
N., Rokhlenko, O., Agichtein, E., and Guy, I., editors,
Proc. Seventh Workshop on e-Commerce and NLP @
LREC-COLING, pages 29–39.
Kunert, J. (2020). Automation in Sports Reporting: Strate-
gies of Data Providers, Software Providers, and Media
Outlets. Media and Communication, 8(3):1–11.
Lakomkin, E., Wu, C., Fathullah, Y., Kalinli, O., Seltzer,
M. L., and Fuegen, C. (2024). End-to-End Speech
Recognition Contextualization with Large Language
Models. In ICASSP 2024 - 2024 IEEE International
Conference on Acoustics, Speech and Signal Process-
ing (ICASSP), pages 12406–12410.
Li, H., Chi, H., Liu, M., and Yang, W. (2024). Look Within,
Why LLMs Hallucinate: A Causal Perspective.
Lin, C.-Y. and Och, F. J. (2004). Automatic evaluation
of machine translation quality using longest common
subsequence and skip-bigram statistics. In Proceed-
ings of the 42nd Annual Meeting on Association for
Computational Linguistics - ACL ’04, pages 1–8.
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D.,
Levy, O., Lewis, M., Zettlemoyer, L., and Stoyanov,
V. (2019). RoBERTa: A Robustly Optimized BERT
Pretraining Approach.
L
¨
ochtefeld, M., J
¨
ackel, C., and Kr
¨
uger, A. (2015). TwitSoc-
cer: Knowledge-Based Crowd-Sourcing of Live Soc-
cer Events. In Proceedings of the 14th International
Conference on MUM, pages 1–4.
Mehta, R. and Varma, V. (2023). LLM-RM at SemEval-
2023 Task 2: Multilingual Complex NER using XLM-
RoBERTa.
Min, Z. and Wang, J. (2024). ”Exploring the Integration
of Large Language Models into Automatic Speech
Recognition Systems: An Empirical Study”. In Luo,
B., Cheng, L., Wu, Z.-G., Li, H., and Li, C., editors,
Neural Information Processing, pages 69–84.
Moreno-Barea, F. J., Jerez, J. M., and Franco, L. (2020).
Improving classification accuracy using data augmen-
tation on small data sets. Expert Systems with Appli-
cations, 161:1–14.
Ojomo, O. W. and Olomojobi, O. T. (2021). Viewing
the Game Textually: Online Consumption of Live
Text Commentary as Alternate Spectatorship Among
Nigerian Football Fans. Communication & Sport,
9(3):496–521.
OpenAI (2025). Introducing Whisper.
https://openai.com/index/whisper/. Accessed:
15.01.2025.
Radford, A., Kim, J. W., Xu, T., Brockman, G., Mcleavey,
C., and Sutskever, I. (2023). Robust speech recog-
nition via large-scale weak supervision. In Proceed-
ings of the 40th International Conference on Machine
Learning, volume 202 of Proceedings of Machine
Learning Research, pages 28492–28518.
Sarkhoosh, M. H., Gautam, S., Midoglu, C., Sabet, S. S.,
Torjusen, T., and Halvorsen, P. (2024). The Soccer-
Sum Dataset for Automated Detection, Segmentation,
and Tracking of Objects on the Soccer Pitch. In Pro-
ceedings of the 15th ACM Multimedia Systems Con-
ference, MMSys ’24, page 353–359.
Strand, A. T., Gautam, S., Midoglu, C., and Halvorsen, P.
(2024). SoccerRAG: Multimodal Soccer Information
Retrieval via Natural Queries. arXiv.
Tasnim, M., Collarana, D., Graux, D., Galkin, M., and
Vidal, M.-E. (2019). COMET: A Contextualized
Molecule-Based Matching Technique. In Lecture
Notes in Computer Science, pages 175–185.
Tonmoy, S. M. T. I., Zaman, S. M. M., Jain, V., Rani, A.,
Rawte, V., Chadha, A., and Das, A. (2024). A Com-
prehensive Survey of Hallucination Mitigation Tech-
niques in Large Language Models.
Tran, N., Tran, H., Nguyen, S., Nguyen, H., and Nguyen, T.
(2019). Does BLEU Score Work for Code Migration?
In 2019 IEEE/ACM 27th International Conference on
Program Comprehension (ICPC), pages 165–176.
Transfermarkt (2025). transfermarkt.com.
https://www.transfermarkt.de/. Accessed:
12.04.2025.
Tuyls, K., Omidshafiei, S., Muller, P., Wang, Z., Connor, J.,
Hennes, D., Graham, I., and et al. (2021). Game Plan:
What AI can do for Football, and What Football can
do for AI. JAIR, 71:41–88.
Wills, S., Bai, Y., Tejedor-Garc
´
ıa, C., Cucchiarini, C., and
Strik, H. (2023). Automatic Speech Recognition of
Non-Native Child Speech for Language Learning Ap-
plications (Short Paper). In arXiv:2306.16710, pages
1–8.
Yuan, W., Neubig, G., and Liu, P. (2021). BARTScore:
Evaluating generated text as text generation. In Ran-
zato, M., Beygelzimer, A., Dauphin, Y., Liang, P., and
Vaughan, J. W., editors, Advances in Neural Informa-
tion Processing Systems, volume 34, pages 27263–
27277.
Zhang, T., Kishore, V., Wu, F., Weinberger, K. Q., and
Artzi, Y. (2020). BERTScore: Evaluating Text Gener-
ation with BERT.
A Novel Approach to Automated Live-Ticker Generation in Football: Using Large Language Models and Audio Data
141