learning techniques: object detection for player 
detection, digit recognition for the jersey number 
identification, and similarity discrimination for re-
identification of the target player.  
Our evaluation results with nine players from 
three videos show that the precision of the target 
player identification is over 90% in all experiments. 
However, the recall performance could be improved.  
In future work, we will extract more sophisticated 
features for the target player identification, such as 
hair style for the case of miss-tracking and frame-out. 
In addition, we will develop a mechanism to improve 
the actual recall performance by considering the 
players’ position and by finding extremely crowded 
situations in order to stop tracking of the individual 
players but keep those frames in which the target 
player might be present in the summary video. 
REFERENCES 
Yang, Y., and Li, D. (2017). Robust player detection and 
tracking in broadcast soccer video based on enhanced 
particle filter. In Journal of Visual Communication and 
Image Representation, volume 46, pages 81-94. 
Liu, Z. J., Tong, X., Li, W., Wang, T., Zhang, Y., and 
Wang, H. (2009). Automatic player detection, labeling 
and tracking in broadcast soccer video. In Journal of 
Pattern Recognition Letters, volume 30, no 2, pages 
103-113. 
Ballan, L., Bertini, M., Bimbo, A., and Nunziati, W. (2007). 
Soccer players identification based on visual local 
features. In Proceedings of the 6th ACM International 
Conference on Image and Video Retrieval (CIVR), 
pages 258-265. 
Ye, Q., Huang, Q., Jiang, S., Liu, Y., and Gao, W. (2005). 
Jersey number detection in sports video for athlete 
identification. In Proceedings of the SPIE Visual 
Communications and Image Processing (VCIP), 
volume 5960, pages 1599-1606. 
Saric, M., Dujmic, H., Papic, V., and Rozic, N. (2008). 
Player number localization and recognition in soccer 
video using hsv color space and internal contours. In 
Proceedings of the 10th WSEAS International Con-
ference on Automation & Information (ICAI), pages 
175-180. 
Gerke, S., Müller, K., and Schäfer, R. (2015). Soccer jersey 
number recognition using convolutional neural 
networks. In IEEE International Conference on 
Computer Vision Workshops (ICCV), pages 17-24. 
Li, G., Xu, S., Liu, X., Li, L., and Wang, C. (2018). Jersey 
number recognition with semi-supervised spatial 
transformer network. In IEEE Conference on Computer 
Vision and Pattern Recognition Workshops (CVPR), 
pages 1896-1903. 
IFAB. International Football Association Board | IFAB. 
http://www.theifab.com/laws (2019-10-03 reference). 
Redmon, J., and Farhadi, A. (2018). YOLOv3: an 
incremental improvement. In arXiv:1804.02767. 
Netzer, Y., Wang, T., Coates, A., Bissacco, A., Wu, B., and 
Ng, A. (2011). Reading digits in natural images with 
unsupervised feature learning. In Neural Information 
Processing Systems Workshops, (NIPS), pages 1-9. 
Bromley, J., Guyon, I., LeCun, Y., Sickinger, E., and Shah, 
R. (1993). Signature verification using a "Siamese" 
time delay neural network. In Proceedings of the 6th 
International Conference on Neural Information 
Processing Systems (NIPS), pages 737-744. 
Koch, G., Zemel, R., and Salakhutdinov, R. (2015). 
Siamese neural networks for one-shot image 
recognition.  In Proceedings of the 32nd International 
Conference on Machine Learning Workshops (ICML), 
volume 37. 
Cao, Q., Shen, L., Xie, W., Parkhi, O. M, and Zisserman, 
A. (2018). VGGFace2: A dataset for recognising face 
across pose and age. In IEEE International Conference 
on Automatic Face and Gesture Recognition (FG). 
Kingma, D.P., and Ba, J. (2015). Adam: a method for 
stochastic optimization. In Proceedings of the 3rd 
International Conference for Learning Repre-
sentations (ICLR).