loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Multi-stage RGB-based Transfer Learning Pipeline for Hand Activity Recognition

Topics: Deep Learning for Visual Understanding ; Egocentric Vision for Interaction Understanding; Event and Human Activity Recognition; Features Extraction; Human and Computer Interaction; Machine Learning Technologies for Vision; Understanding from Wearable and Mobile Cameras; Visual Attention and Image Saliency

Authors: Yasser Boutaleb 1 ; 2 ; Catherine Soladie 1 ; Nam-Duong Duong 2 ; Jérôme Royan 2 and Renaud Seguier 1

Affiliations: 1 IETR/CentraleSupelec, Avenue de la Boulaie, 35510 Cesson-Sevigné, France ; 2 IRT b-com, 1219 Avenue des Champs Blancs, 35510 Cesson-Sevigné, France

Keyword(s): First-person Hand Activity Recognition, Transfer Learning, Multi-stream Learning, Features Fusion.

Abstract: First-person hand activity recognition is a challenging task, especially when not enough data are available. In this paper, we tackle this challenge by proposing a new low-cost multi-stage learning pipeline for first-person RGB-based hand activity recognition on a limited amount of data. For a given RGB image activity sequence, in the first stage, the regions of interest are extracted using a pre-trained neural network (NN). Then, in the second stage, high-level spatial features are extracted using pre-trained deep NN. In the third stage, the temporal dependencies are learned. Finally, in the last stage, a hand activity sequence classifier is learned, using a post-fusion strategy, which is applied to the previously learned temporal dependencies. The experiments evaluated on two real-world data sets shows that our pipeline achieves the state-of-the-art. Moreover, it shows that the proposed pipeline achieves good results on limited data.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.226.187.24

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Boutaleb, Y.; Soladie, C.; Duong, N.; Royan, J. and Seguier, R. (2022). Multi-stage RGB-based Transfer Learning Pipeline for Hand Activity Recognition. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 4: VISAPP; ISBN 978-989-758-555-5; ISSN 2184-4321, SciTePress, pages 839-848. DOI: 10.5220/0010856200003124

@conference{visapp22,
author={Yasser Boutaleb. and Catherine Soladie. and Nam{-}Duong Duong. and Jérôme Royan. and Renaud Seguier.},
title={Multi-stage RGB-based Transfer Learning Pipeline for Hand Activity Recognition},
booktitle={Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 4: VISAPP},
year={2022},
pages={839-848},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010856200003124},
isbn={978-989-758-555-5},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 4: VISAPP
TI - Multi-stage RGB-based Transfer Learning Pipeline for Hand Activity Recognition
SN - 978-989-758-555-5
IS - 2184-4321
AU - Boutaleb, Y.
AU - Soladie, C.
AU - Duong, N.
AU - Royan, J.
AU - Seguier, R.
PY - 2022
SP - 839
EP - 848
DO - 10.5220/0010856200003124
PB - SciTePress