loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Monika Wysoczańska 1 ; 2 and Tomasz Trzciński 3 ; 2

Affiliations: 1 Sport Algorithmics and Gaming, Poland ; 2 Warsaw University of Technology, Poland ; 3 Tooploox, Poland

Keyword(s): Multimodal Learning, Activity Recognition, Music Genre Classification, Multimodal Fusion.

Abstract: Video content analysis is still an emerging technology, and the majority of work in this area extends from the still image domain. Dance videos are especially difficult to analyse and recognise as the performed human actions are highly dynamic. In this work, we introduce a multimodal approach for dance video recognition. Our proposed method combines visual and audio information, by fusing their representations, to improve classification accuracy. For the visual part, we focus on motion representation, as it is the key factor in distinguishing dance styles. For audio representation, we put the emphasis on capturing long-term dependencies, such as tempo, which is a crucial dance discriminator. Finally, we fuse two distinct modalities using a late fusion approach. We compare our model with corresponding unimodal approaches, by giving exhaustive evaluation on the Let’s Dance dataset. Our method yields significantly better results than each single-modality approach. Results presented in t his work not only demonstrate the strength of integrating complementary sources of information in the recognition task, but also indicate the potential of applying multimodal approaches within specific research areas. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.133.79.70

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Wysoczańska, M. and Trzciński, T. (2020). Multimodal Dance Recognition. In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP; ISBN 978-989-758-402-2; ISSN 2184-4321, SciTePress, pages 558-565. DOI: 10.5220/0009326005580565

@conference{visapp20,
author={Monika Wysoczańska. and Tomasz Trzciński.},
title={Multimodal Dance Recognition},
booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP},
year={2020},
pages={558-565},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009326005580565},
isbn={978-989-758-402-2},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP
TI - Multimodal Dance Recognition
SN - 978-989-758-402-2
IS - 2184-4321
AU - Wysoczańska, M.
AU - Trzciński, T.
PY - 2020
SP - 558
EP - 565
DO - 10.5220/0009326005580565
PB - SciTePress