End-to-end Global to Local CNN Learning for Hand Pose Recovery in Depth Data

Madadi, Meysam; Escalera, Sergio; Baro, Xavier; Gonzalez, Jordi

Computer Science > Computer Vision and Pattern Recognition

arXiv:1705.09606 (cs)

[Submitted on 26 May 2017 (v1), last revised 11 Apr 2018 (this version, v2)]

Title:End-to-end Global to Local CNN Learning for Hand Pose Recovery in Depth Data

Authors:Meysam Madadi, Sergio Escalera, Xavier Baro, Jordi Gonzalez

View PDF

Abstract:Despite recent advances in 3D pose estimation of human hands, especially thanks to the advent of CNNs and depth cameras, this task is still far from being solved. This is mainly due to the highly non-linear dynamics of fingers, which make hand model training a challenging task. In this paper, we exploit a novel hierarchical tree-like structured CNN, in which branches are trained to become specialized in predefined subsets of hand joints, called local poses. We further fuse local pose features, extracted from hierarchical CNN branches, to learn higher order dependencies among joints in the final pose by end-to-end training. Lastly, the loss function used is also defined to incorporate appearance and physical constraints about doable hand motion and deformation. Finally, we introduce a non-rigid data augmentation approach to increase the amount of training depth data. Experimental results suggest that feeding a tree-shaped CNN, specialized in local poses, into a fusion network for modeling joints correlations and dependencies, helps to increase the precision of final estimations, outperforming state-of-the-art results on NYU and SyntheticHand datasets.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1705.09606 [cs.CV]
	(or arXiv:1705.09606v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1705.09606

Submission history

From: Meysam Madadi [view email]
[v1] Fri, 26 May 2017 14:55:44 UTC (7,869 KB)
[v2] Wed, 11 Apr 2018 23:26:00 UTC (9,444 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2017-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Meysam Madadi
Sergio Escalera
Xavier Baró
Jordi Gonzàlez

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:End-to-end Global to Local CNN Learning for Hand Pose Recovery in Depth Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:End-to-end Global to Local CNN Learning for Hand Pose Recovery in Depth Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators