Local and Global Feature Descriptors Combination from RGB-Depth Videos for Human Action Recognition

Rawya Al-Akam, Dietrich Paulus

Abstract

This paper attempts to present human action recognition through the combination of local and global feature descriptors values, which are extracted from RGB and Depth videos. A video sequence is represented as a collection of spatio and spatio-temporal features. However, the challenging problems exist in both local and global descriptors for classifying human actions. We proposed a novel combination of the two descriptor methods, 3D trajectory and motion boundary histogram for the local feature and global Gist feature descriptor for the global feature (3DTrMBGG). To solve the problems of the structural information among the local descriptors, and clutter background and occlusion among the global descriptor, the combination of the local and global features descriptor is used. In this paper, there are three novel combination steps of video descriptors. First, combines motion and 3D trajectory shape descriptors. Second, extract the structural information using global gist descriptor. Third, combines these two descriptor steps to get the 3DTrMBGG feature vector from spatio-temporal domains. The results of the 3DTrMBGG features are used along with the K-mean clustering and multi-class support vector machine classifier. Our new method on several video actions improves performance on actions even with low movement rate and outperforms the competing state-of-the-art -temporal feature-based human action recognition methods.

Download


Paper Citation


in Harvard Style

Al-Akam R. and Paulus D. (2018). Local and Global Feature Descriptors Combination from RGB-Depth Videos for Human Action Recognition.In Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-276-9, pages 265-272. DOI: 10.5220/0006525002650272


in Bibtex Style

@conference{icpram18,
author={Rawya Al-Akam and Dietrich Paulus},
title={Local and Global Feature Descriptors Combination from RGB-Depth Videos for Human Action Recognition},
booktitle={Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2018},
pages={265-272},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006525002650272},
isbn={978-989-758-276-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Local and Global Feature Descriptors Combination from RGB-Depth Videos for Human Action Recognition
SN - 978-989-758-276-9
AU - Al-Akam R.
AU - Paulus D.
PY - 2018
SP - 265
EP - 272
DO - 10.5220/0006525002650272