End‐to‐end global to local convolutional neural network learning for hand pose recovery in depth data. Issue 1 (12th August 2021)
- Record Type:
- Journal Article
- Title:
- End‐to‐end global to local convolutional neural network learning for hand pose recovery in depth data. Issue 1 (12th August 2021)
- Main Title:
- End‐to‐end global to local convolutional neural network learning for hand pose recovery in depth data
- Authors:
- Madadi, Meysam
Escalera, Sergio
Baró, Xavier
Gonzàlez, Jordi - Abstract:
- Abstract: Despite recent advances in 3‐D pose estimation of human hands, thanks to the advent of convolutional neural networks (CNNs) and depth cameras, this task is still far from being solved in uncontrolled setups. This is mainly due to the highly non‐linear dynamics of fingers and self‐occlusions, which make hand model training a challenging task. In this study, a novel hierarchical tree‐like structured CNN is exploited, in which branches are trained to become specialised in predefined subsets of hand joints called local poses. Further, local pose features, extracted from hierarchical CNN branches, are fused to learn higher order dependencies among joints in the final pose by end‐to‐end training. Lastly, the loss function used is also defined to incorporate appearance and physical constraints about doable hand motions and deformations. Finally, a non‐rigid data augmentation approach is introduced to increase the amount of training depth data. Experimental results suggest that feeding a tree‐shaped CNN, specialised in local poses, into a fusion network for modelling joints' correlations and dependencies, helps to increase the precision of final estimations, showing competitive results on NYU, MSRA, Hands17 and SyntheticHand datasets.
- Is Part Of:
- IET computer vision. Volume 16:Issue 1(2022)
- Journal:
- IET computer vision
- Issue:
- Volume 16:Issue 1(2022)
- Issue Display:
- Volume 16, Issue 1 (2022)
- Year:
- 2022
- Volume:
- 16
- Issue:
- 1
- Issue Sort Value:
- 2022-0016-0001-0000
- Page Start:
- 50
- Page End:
- 66
- Publication Date:
- 2021-08-12
- Subjects:
- computer vision -- data acquisition -- human computer interaction -- learning (artificial intelligence) -- pose estimation
convolutional neural nets -- stereo image processing
Computer vision -- Periodicals
Pattern recognition systems -- Periodicals
006.37 - Journal URLs:
- http://digital-library.theiet.org/content/journals/iet-cvi ↗
http://www.ietdl.org/IET-CVI ↗
https://ietresearch.onlinelibrary.wiley.com/journal/17519640 ↗
http://www.theiet.org/ ↗ - DOI:
- 10.1049/cvi2.12064 ↗
- Languages:
- English
- ISSNs:
- 1751-9632
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252250
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26270.xml