stadiums,  for  example  two  groups  of  fans  of  both 
teams  of  the  soccer  match,  methods  of  group 
detection  in  crowded  scenes  could  be  also  useful 
(Pandey et al., 2020). 
The most significant advantage of audience shot 
detection is the opportunity to analyze the behavior of 
spectators, their club clothes, flags and banners, and 
even their gestures and expressions of joy, and then 
categorize and better annotate sports videos. 
REFERENCES 
Assfalg,  J., Bertini, M.,  Colombo,  C.,  and  Del Bimbo  A. 
(2002).  Semantic  annotation  of  sports  videos,  IEEE 
Multimedia, 9 (2): 52–60. 
Bertini  M.,  Del  Bimbo,  A.,  and  Nunziati,  W.  (2005). 
Automatic annotation of sport video content, in: Lazo 
M. and  Sanfeliu  A. (Eds.),  CIARP,  LNCS, vol.  3773, 
pp. 1066–1078. 
Choroś  K.  (2012).  Video  structure  analysis  for  content-
based indexing and categorisation of TV sports news. 
International Journal of Intelligent Information and 
Database Systems, 6(5): 451–465. 
Choroś K. (2016). Weighted indexing of TV  sports news 
videos.  Multimedia Tools and Applications,  75(24): 
16923–16942. 
Choroś,  K.  (2010).  Video  structure  analysis  and  content-
based indexing in  the  Automatic Video  Indexer AVI, 
in: Advances in Multimedia and Network Information 
System Technologies, Advances in Intelligent and Soft 
Computing, Springer, AISC, vol. 80, pp. 79–90. 
Choroś, K. (2012). Detection of tennis court lines for sport 
video  categorization,  in:  Computational Collective 
Intelligence. Technologies and Applications, Springer, 
LNCS, vol. 7654, pp. 304–314. 
Daudpota,  S.M.,  Muhammad,  A.,  and  Baber,  J.  (2019). 
Video genre identification using clustering-based shot 
detection  algorithm.  Signal, Image and Video 
Processing, 13(7): 1413–1420. 
Ekin,  A.,  Tekalp,  A.M.,  and  Mehrotra,  R.  (2003). 
Automatic  soccer  video  analysis  and  summarization. 
IEEE Transactions on Image Processing, 12(7): 796–
807. 
Fang, T., and Ping, S. (2013). Attractive events detection in 
soccer  videos  based  on  identification  of  shots,”  in: 
Proceedings of the 3rd International Conference on 
Multimedia Technology ICMT-13,  Atlantis  Press, 
pp. 814–822. 
Jiang,  H.,  and  Zhang,  M.  (2011).  Tennis  video  shot 
classification  based  on  support  vector  machine,  in: 
Proceedings of the IEEE International Conference on 
Computer Science and Automation Engineering, IEEE,  
vol. 2, pp. 757–761.  
Kuo, C.M., Chang, W.H., Fang, M.Y., and Lin, C.H. (2011) 
A  template-based  baseball  video  scene  classification 
using  efficient  playfield  segmentation.  Multimedia 
Tools and Applications, 55(3): 399–422. 
Li, Y., and Dorai, C. (2005). Video frame identification for 
learning media content understanding, in: Proceedings 
of the IEEE International Conference on Multimedia 
and Expo, IEEE, pp. 1488–1491. 
Mita, T., Kaneko, T., and Hori, O. (2005). Joint Haar-like 
features for face detection, in: Proceedings of the Tenth 
IEEE International Conference on Computer Vision 
(ICCV'05), IEEE, vol. 2, pp. 1619–1626. 
Mousse, M.A., Motamed, C., and Ezin, E.C. (2017). People 
counting  via  multiple  views  using  a  fast  information 
fusion  approach.  Multimedia Tools and Applications, 
76(5): 6801-6819. 
Ou, T.S., Huang, Y.H., and Chen, H.H. (2011). SSIM-based 
perceptual  rate  control  for  video  coding,  IEEE 
Transactions on Circuits and Systems for Video 
Technology, 21(5): 682–691. 
Pandey,  M.,  Singhal,  S.,  and  Tripathi,  V.  (2020).  An 
efficient  vision-based  group  detection  framework  in 
crowded scene. in: Frontiers in Intelligent Computing: 
Theory and Applications,  Springer,  AISC,  vol.  1014, 
pp. 201–209. 
Reisman, P., Mano, O., Avidan, S., and Shashua, A. (2004). 
Crowd detection in video sequences, in: Proceedings of 
the IEEE Intelligent Vehicles Symposium,  IEEE,  pp. 
66–71. 
Shih  H.-C.  (2017).  A  survey  of  content-aware  video 
analysis for sports. IEEE Transactions on Circuits and 
Systems for Video Technology, 28(5): 1212–1231. 
Thomas, G., Gade, R., Moeslund, T.B., Carr, P., and Hilton, 
A.  (2017).  Computer  vision  for  sports:  Current 
applications and research topics. Computer Vision and 
Image Understanding, 159: 3–18. 
Wang, Z., Bovik, A.C., Sheikh, H.R., and Simoncelli E.P. 
(2004). Image quality assessment: from error visibility 
to  structural  similarity.  IEEE Transactions on Image 
Processing, 13(4): 600–612. 
Wang, Z., Cheng, C., and Wang, X. (2018). A fast crowd 
segmentation  method,  in:  Proceedings of the 
International Conference on Audio, Language and 
Image Processing ICALIP, IEEE, pp. 242–245. 
Yakut, M., and  Kehtarnavaz, N. (2016).  Ice-hockey  puck 
detection  and  tracking  for  video  highlighting.  Signal, 
Image and Video Processing, 10(3): 527–533. 
Zhang, Y.Z., Dong, Q., Wang, J.Y., and Dai, Y.W. (2011). 
Court view shots detection based on Hough transform 
and  SVM,  in:  Proceedings of the International 
Workshop on Multi-Platform/Multi-Sensor Remote 
Sensing and Mapping, IEEE, pp. 1–4.