Gesture recognition algorithm based on multi‐scale feature fusion in RGB‐D images. Issue 4 (23rd December 2022)
- Record Type:
- Journal Article
- Title:
- Gesture recognition algorithm based on multi‐scale feature fusion in RGB‐D images. Issue 4 (23rd December 2022)
- Main Title:
- Gesture recognition algorithm based on multi‐scale feature fusion in RGB‐D images
- Authors:
- Sun, Ying
Weng, Yaoqing
Luo, Bowen
Li, Gongfa
Tao, Bo
Jiang, Du
Chen, Disi - Abstract:
- Abstract: With the rapid development of sensor technology and artificial intelligence, the video gesture recognition technology under the background of big data makes human‐computer interaction more natural and flexible, bringing richer interactive experience to teaching, on‐board control, electronic games, etc. In order to perform robust recognition under the conditions of illumination change, background clutter, rapid movement, partial occlusion, an algorithm based on multi‐level feature fusion of two‐stream convolutional neural network is proposed, which includes three main steps. Firstly, the Kinect sensor obtains RGB‐D images to establish a gesture database. At the same time, data enhancement is performed on training and test sets. Then, a model of multi‐level feature fusion of two‐stream convolutional neural network is established and trained. Experiments result show that the proposed network model can robustly track and recognize gestures, and compared with the single‐channel model, the average detection accuracy is improved by 1.08%, and mean average precision (mAP) is improved by 3.56%. The average recognition rate of gestures under occlusion and different light intensity was 93.98%. Finally, in the ASL dataset, LaRED dataset, and 1‐miohand dataset, recognition accuracy shows satisfactory performances compared to the other method. Abstract : The proposed network model can robustly track and recognize gestures under complex backgrounds, and compared with theAbstract: With the rapid development of sensor technology and artificial intelligence, the video gesture recognition technology under the background of big data makes human‐computer interaction more natural and flexible, bringing richer interactive experience to teaching, on‐board control, electronic games, etc. In order to perform robust recognition under the conditions of illumination change, background clutter, rapid movement, partial occlusion, an algorithm based on multi‐level feature fusion of two‐stream convolutional neural network is proposed, which includes three main steps. Firstly, the Kinect sensor obtains RGB‐D images to establish a gesture database. At the same time, data enhancement is performed on training and test sets. Then, a model of multi‐level feature fusion of two‐stream convolutional neural network is established and trained. Experiments result show that the proposed network model can robustly track and recognize gestures, and compared with the single‐channel model, the average detection accuracy is improved by 1.08%, and mean average precision (mAP) is improved by 3.56%. The average recognition rate of gestures under occlusion and different light intensity was 93.98%. Finally, in the ASL dataset, LaRED dataset, and 1‐miohand dataset, recognition accuracy shows satisfactory performances compared to the other method. Abstract : The proposed network model can robustly track and recognize gestures under complex backgrounds, and compared with the single‐channel model, the average detection accuracy is improved by 1.08%, and mean average precision (mAP) is improved by 3.56%. … (more)
- Is Part Of:
- IET image processing. Volume 17:Issue 4(2023)
- Journal:
- IET image processing
- Issue:
- Volume 17:Issue 4(2023)
- Issue Display:
- Volume 17, Issue 4 (2023)
- Year:
- 2023
- Volume:
- 17
- Issue:
- 4
- Issue Sort Value:
- 2023-0017-0004-0000
- Page Start:
- 1280
- Page End:
- 1290
- Publication Date:
- 2022-12-23
- Subjects:
- image processing -- neural nets
Image processing -- Periodicals
621.36705 - Journal URLs:
- http://digital-library.theiet.org/content/journals/iet-ipr ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=4149689 ↗
http://www.ietdl.org/IET-IPR ↗
https://ietresearch.onlinelibrary.wiley.com/journal/17519667 ↗
http://www.theiet.org/ ↗ - DOI:
- 10.1049/ipr2.12712 ↗
- Languages:
- English
- ISSNs:
- 1751-9659
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26105.xml