Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots. (August 2017)
- Record Type:
- Journal Article
- Title:
- Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots. (August 2017)
- Main Title:
- Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots
- Authors:
- Ding, Ing-Jr
Shi, Jia-Yi - Abstract:
- Highlights: Kinect microphone array-based voice control for operating a robot is proposed. Speech and speaker recognition are effectively combined for fine robot control. Kinect fuzzy-DTW with an accurately designed fuzzy controller is proposed. Abstract: This study developed a Kinect microphone array-based method for the voice-based control of humanoid robot exhibitions through speech and speaker recognition. A support vector machine (SVM), a Gaussian mixture model (GMM), and dynamic time warping (DTW) were used for speaker verification, speaker identification, and speech recognition, respectively; they were effectively combined for realizing advanced voice-based control of humanoid robot exhibitions. Speech recognition capability was enhanced by using the Kinect microphone array and by combining the DTW-based recognition decisions associated with all the microphones through a fuzzy control scheme. A humanoid robot with the proposed voice-based control can be controlled through voice commands by authenticated users. The robot first verifies the authenticity of the personal operator, following which it identifies the operator and validates the command. Subsequently, it executes the command if both the user and command are valid. Experimental results demonstrated the effectiveness and accuracy of the proposed method. Graphical abstract:
- Is Part Of:
- Computers & electrical engineering. Volume 62(2017)
- Journal:
- Computers & electrical engineering
- Issue:
- Volume 62(2017)
- Issue Display:
- Volume 62, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 62
- Issue:
- 2017
- Issue Sort Value:
- 2017-0062-2017-0000
- Page Start:
- 719
- Page End:
- 729
- Publication Date:
- 2017-08
- Subjects:
- Kinect microphone array -- Speaker recognition -- Speech recognition -- Humanoid robot -- Kinect-SSA -- Kinect fuzzy–DTW
Computer engineering -- Periodicals
Electrical engineering -- Periodicals
Electrical engineering -- Data processing -- Periodicals
Ordinateurs -- Conception et construction -- Périodiques
Électrotechnique -- Périodiques
Électrotechnique -- Informatique -- Périodiques
Computer engineering
Electrical engineering
Electrical engineering -- Data processing
Periodicals
Electronic journals
621.302854 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00457906/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compeleceng.2015.12.010 ↗
- Languages:
- English
- ISSNs:
- 0045-7906
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.680000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4714.xml