3D Convolutional Neural Networks for Dynamic Sign Language Recognition. (14th May 2018)
- Record Type:
- Journal Article
- Title:
- 3D Convolutional Neural Networks for Dynamic Sign Language Recognition. (14th May 2018)
- Main Title:
- 3D Convolutional Neural Networks for Dynamic Sign Language Recognition
- Authors:
- Liang, Zhi-jie
Liao, Sheng-bin
Hu, Bing-zhang - Editors:
- Manolopoulos, Yannis
- Abstract:
- Abstract: Automatic dynamic sign language recognition is even more challenging than gesture recognition due to the fact that the vocabularies are large and signs are context dependent. Previous works in this direction tend to build classifiers based on complex hand-crafted features computed from the raw inputs. As a type of deep learning model, convolutional neural networks (CNNs) have significantly advanced the accuracy of human gesture classification. However, such methods are currently used to treat video frames as 2D images and recognize gestures at the individual frame level. In this paper, we present a data driven system in which 3D-CNNs are applied to extract spatial and temporal features from video streams, and the motion information is captured by noting the variation in depth between each pair of consecutive frames. To further boost the performance, multi-modal of video streams, including infrared, contour and skeleton are used as input for the architecture and the prediction results estimated from the different sub-networks were fused together. In order to validate our method, we introduce a new challenging multi-modal dynamic sign language dataset captured with Kinect sensors. We evaluate the proposed approach on the collected dataset and achieve superior performance. Moreover, our method achieves a mean Jaccard Index score of 0.836 on the ChaLearn Looking at People Gesture datasets.
- Is Part Of:
- Computer journal. Volume 61:Number 11(2018)
- Journal:
- Computer journal
- Issue:
- Volume 61:Number 11(2018)
- Issue Display:
- Volume 61, Issue 11 (2018)
- Year:
- 2018
- Volume:
- 61
- Issue:
- 11
- Issue Sort Value:
- 2018-0061-0011-0000
- Page Start:
- 1724
- Page End:
- 1736
- Publication Date:
- 2018-05-14
- Subjects:
- Deep learning -- 3D convolutional neural networks -- model combination -- sign language recognition
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxy049 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12178.xml