loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Ryota Sato 1 ; Suzana Beleza 1 ; Erica Shimomoto 2 ; Matheus Silva de Lima 1 ; Nobuko Kato 3 and Kazuhiro Fukui 1

Affiliations: 1 University of Tsukuba, Department of Computer Science, Tsukuba, Ibaraki, Japan ; 2 National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan ; 3 Tsukuba University of Technology, Faculty of Industrial Technology, Tsukuba, Ibaraki, Japan

Keyword(s): Sign Language Recognition, 3D Fast Fourier Transform, Product Grassmann Manifold, Subspace-Based Methods.

Abstract: This paper proposes a subspace-based method for sign language recognition in videos. Typical subspace-based methods represent a video as a low-dimensional subspace generated by applying principal component analysis (PCA) to a set of images from the video. Such representation is compact and practical for motion recognition under few learning data. However, given the complex motion and structure in sign languages, subspace-based methods need to improve performance as they do not consider temporal information like the order of frames. To address this issue, we propose processing time-domain information on the frequency-domain by applying the three-dimensional fast Fourier transform (3D-FFT) to sign videos, where a sign video is represented as a 3D amplitude spectrum tensor, which is invariant to deviations in the spatial and temporal directions of target objects. Further, a 3D amplitude spectral tensor is regarded as one point on the Product Grassmann Manifold (PGM). By unfolding the te nsor in all three dimensions, PGM can account for the temporal information. Finally, we calculate video similarity by using the distances between two corresponding points on the PGM. The effectiveness of the proposed method is demonstrated on private and public sign language recognition datasets, showing a significant performance improvement over conventional subspace-based methods. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.136.17.105

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Sato, R.; Beleza, S.; Shimomoto, E.; Silva de Lima, M.; Kato, N. and Fukui, K. (2024). Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain. In Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-684-2; ISSN 2184-4313, SciTePress, pages 152-159. DOI: 10.5220/0012577000003654

@conference{icpram24,
author={Ryota Sato. and Suzana Beleza. and Erica Shimomoto. and Matheus {Silva de Lima}. and Nobuko Kato. and Kazuhiro Fukui.},
title={Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain},
booktitle={Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2024},
pages={152-159},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012577000003654},
isbn={978-989-758-684-2},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Sign Language Recognition Based on Subspace Representations in the Spatio-Temporal Frequency Domain
SN - 978-989-758-684-2
IS - 2184-4313
AU - Sato, R.
AU - Beleza, S.
AU - Shimomoto, E.
AU - Silva de Lima, M.
AU - Kato, N.
AU - Fukui, K.
PY - 2024
SP - 152
EP - 159
DO - 10.5220/0012577000003654
PB - SciTePress