LAMV: Learning to Predict Where Spectators Look in Live Music Performances

Arturo Fuentes, Arturo Fuentes, F. Javier Sánchez, Thomas Voncina, Jorge Bernal

2021

Abstract

The advent of artificial intelligence has supposed an evolution on how different daily work tasks are performed. The analysis of cultural content has seen a huge boost by the development of computer-assisted methods that allows easy and transparent data access. In our case, we deal with the automation of the production of live shows, like music concerts, aiming to develop a system that can indicate the producer which camera to show based on what each of them is showing. In this context, we consider that is essential to understand where spectators look and what they are interested in so the computational method can learn from this information. The work that we present here shows the results of a first preliminary study in which we compare areas of interest defined by human beings and those indicated by an automatic system. Our system is based on the extraction of motion textures from dynamic Spatio-Temporal Volumes (STV) and then analyzing the patterns by means of texture analysis techniques. We validate our approach over several video sequences that have been labeled by 16 different experts. Our method is able to match those relevant areas identified by the experts, achieving recall scores higher than 80% when a distance of 80 pixels between method and ground truth is considered. Current performance shows promise when detecting abnormal peaks and movement trends.

Download


Paper Citation


in Harvard Style

Fuentes A., Sánchez F., Voncina T. and Bernal J. (2021). LAMV: Learning to Predict Where Spectators Look in Live Music Performances. In Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 5: VISAPP; ISBN 978-989-758-488-6, SciTePress, pages 500-507. DOI: 10.5220/0010254005000507


in Bibtex Style

@conference{visapp21,
author={Arturo Fuentes and F. Javier Sánchez and Thomas Voncina and Jorge Bernal},
title={LAMV: Learning to Predict Where Spectators Look in Live Music Performances},
booktitle={Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 5: VISAPP},
year={2021},
pages={500-507},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010254005000507},
isbn={978-989-758-488-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 5: VISAPP
TI - LAMV: Learning to Predict Where Spectators Look in Live Music Performances
SN - 978-989-758-488-6
AU - Fuentes A.
AU - Sánchez F.
AU - Voncina T.
AU - Bernal J.
PY - 2021
SP - 500
EP - 507
DO - 10.5220/0010254005000507
PB - SciTePress