Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition. (April 2018)
- Record Type:
- Journal Article
- Title:
- Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition. (April 2018)
- Main Title:
- Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition
- Authors:
- Núñez, Juan C.
Cabido, Raúl
Pantrigo, Juan J.
Montemayor, Antonio S.
Vélez, José F. - Abstract:
- Highlights: Combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network for skeleton-based human activity and hand gesture recognition. Two-stage training strategy which firstly focuses on the CNN training and, secondly, adjusts the full method CNN+LSTM. A method for data augmentation in the context of spatiotemporal 3D data sequences. An exhaustive experimental study on publicly available data benchmarks with respect to the state-of-the-art most representative methods. Comparison among different CPU and GPU platforms. Abstract: In this work, we address human activity and hand gesture recognition problems using 3D data sequences obtained from full-body and hand skeletons, respectively. To this aim, we propose a deep learning-based approach for temporal 3D pose recognition problems based on a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network. We also present a two-stage training strategy which firstly focuses on CNN training and, secondly, adjusts the full method (CNN+LSTM). Experimental testing demonstrated that our training method obtains better results than a single-stage training strategy. Additionally, we propose a data augmentation method that has also been validated experimentally. Finally, we perform an extensive experimental study on publicly available data benchmarks. The results obtained show how the proposed approach reaches state-of-the-art performance whenHighlights: Combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network for skeleton-based human activity and hand gesture recognition. Two-stage training strategy which firstly focuses on the CNN training and, secondly, adjusts the full method CNN+LSTM. A method for data augmentation in the context of spatiotemporal 3D data sequences. An exhaustive experimental study on publicly available data benchmarks with respect to the state-of-the-art most representative methods. Comparison among different CPU and GPU platforms. Abstract: In this work, we address human activity and hand gesture recognition problems using 3D data sequences obtained from full-body and hand skeletons, respectively. To this aim, we propose a deep learning-based approach for temporal 3D pose recognition problems based on a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network. We also present a two-stage training strategy which firstly focuses on CNN training and, secondly, adjusts the full method (CNN+LSTM). Experimental testing demonstrated that our training method obtains better results than a single-stage training strategy. Additionally, we propose a data augmentation method that has also been validated experimentally. Finally, we perform an extensive experimental study on publicly available data benchmarks. The results obtained show how the proposed approach reaches state-of-the-art performance when compared to the methods identified in the literature. The best results were obtained for small datasets, where the proposed data augmentation strategy has greater impact. … (more)
- Is Part Of:
- Pattern recognition. Volume 76(2018:Apr.)
- Journal:
- Pattern recognition
- Issue:
- Volume 76(2018:Apr.)
- Issue Display:
- Volume 76 (2018)
- Year:
- 2018
- Volume:
- 76
- Issue Sort Value:
- 2018-0076-0000-0000
- Page Start:
- 80
- Page End:
- 94
- Publication Date:
- 2018-04
- Subjects:
- Deep learning -- Convolutional Neural Network -- Recurrent neural network -- Long Short-Term Memory -- Human activity recognition -- Hand gesture recognition -- Real-time
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2017.10.033 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11338.xml