Fine-tuning Siamese Networks to Assess Sport Gestures Quality

Mégane Millan, Catherine Achard

2020

Abstract

This paper presents an Action Quality Assessment (AQA) approach that learns to automatically score action realization from temporal sequences like videos. To manage the small size of most of databases capturing actions or gestures, we propose to use Siamese Networks. In the literature, Siamese Networks are widely used to rank action scores. Indeed, their purpose is not to regress scores but to predict a value that respects true scores order so that it can be used to rank actions according to their quality. For AQA, we need to predict real scores, as well as the difference between these scores and their range. Thus, we first introduce a new loss function to train Siamese Networks in order to regress score gaps. Once the Siamese network is trained, a branch of this network is extracted and fine-tuned for score prediction. We tested our approach on a public database, the AQA-7 dataset, composed of videos from 7 sports. Our results outperform state of the art on AQA task. Moreover, we show that the proposed method is also more efficient for action ranking.

Download


Paper Citation


in Harvard Style

Millan M. and Achard C. (2020). Fine-tuning Siamese Networks to Assess Sport Gestures Quality. In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP; ISBN 978-989-758-402-2, SciTePress, pages 57-65. DOI: 10.5220/0008924600570065


in Bibtex Style

@conference{visapp20,
author={Mégane Millan and Catherine Achard},
title={Fine-tuning Siamese Networks to Assess Sport Gestures Quality},
booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP},
year={2020},
pages={57-65},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008924600570065},
isbn={978-989-758-402-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP
TI - Fine-tuning Siamese Networks to Assess Sport Gestures Quality
SN - 978-989-758-402-2
AU - Millan M.
AU - Achard C.
PY - 2020
SP - 57
EP - 65
DO - 10.5220/0008924600570065
PB - SciTePress