# Regression-based 3D Hand Pose Estimation using Heatmaps

### Chaitanya Bandi, Ulrike Thomas

#### 2020

#### Abstract

3D hand pose estimation is a challenging problem in human-machine interaction applications. We introduce a simple and effective approach for 3D hand pose estimation in grasping scenarios taking advantage of a low-cost RGB-D camera. 3D hand pose estimation plays a major role in an environment where objects are handed over between the human and robot hand to avoid collisions and to collaborate in shared workspaces. We consider Convolutional Neural Networks (CNNs) to determine a solution to our challenge. The idea of cascaded CNNs is very appropriate for real-time applications. In the paper, we introduce an architecture for direct 3D normalized coordinates regression and a small-scale dataset for human-machine interaction applications. In a cascaded network, the first network minimizes the search space, then the second network is trained within the confined region to detect more accurate 2D heatmaps of joint’s locations. Finally, 3D normalized joints are regressed directly on RGB images and depth maps can lift normalized coordinates to camera coordinates.

Download#### Paper Citation

#### in Harvard Style

Bandi C. and Thomas U. (2020). **Regression-based 3D Hand Pose Estimation using Heatmaps**. In *Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP*; ISBN 978-989-758-402-2, SciTePress, pages 636-643. DOI: 10.5220/0008973206360643

#### in Bibtex Style

@conference{visapp20,

author={Chaitanya Bandi and Ulrike Thomas},

title={Regression-based 3D Hand Pose Estimation using Heatmaps},

booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP},

year={2020},

pages={636-643},

publisher={SciTePress},

organization={INSTICC},

doi={10.5220/0008973206360643},

isbn={978-989-758-402-2},

}

#### in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP

TI - Regression-based 3D Hand Pose Estimation using Heatmaps

SN - 978-989-758-402-2

AU - Bandi C.

AU - Thomas U.

PY - 2020

SP - 636

EP - 643

DO - 10.5220/0008973206360643

PB - SciTePress