Robot Audition and Computational Auditory Scene Analysis. (8th July 2020)
- Record Type:
- Journal Article
- Title:
- Robot Audition and Computational Auditory Scene Analysis. (8th July 2020)
- Main Title:
- Robot Audition and Computational Auditory Scene Analysis
- Authors:
- Nakadai, Kazuhiro
Okuno, Hiroshi G. - Abstract:
- Abstract : Robot audition aims at developing robot's ears that work in the real world, that is, machine listening of multiple sound sources. Its critical problem is noise. Speech interfaces have become more familiar and more indispensable as smartphones and artificial intelligence (AI) speakers spread. Their critical problems are noise and multiple simultaneous speakers. Recently two technological advances have contributed to significantly improve the performance of speech interfaces and robot audition. Emerging deep learning technology has improved noise robustness of automatic speech recognition, whereas microphone array processing has improved the performance of preprocessing such as noise reduction. Herein, an overview and history of robot audition are provided together with introduction of an open‐source software for robot audition and its wide applications in the real world. Also, it is discussed how robot audition contributes to the development of computational auditory scene analysis, that is, understanding of real‐world auditory environments. Abstract : Herein, an overview and history of robot audition are presented, and an open‐source software for robot audition is introduced together with its wide applications in the real world. It is also discussed how robot audition contributes to the development of computational auditory scene analysis, that is, understanding of real‐world auditory environments.
- Is Part Of:
- Advanced intelligent systems. Volume 2:Number 9(2020)
- Journal:
- Advanced intelligent systems
- Issue:
- Volume 2:Number 9(2020)
- Issue Display:
- Volume 2, Issue 9 (2020)
- Year:
- 2020
- Volume:
- 2
- Issue:
- 9
- Issue Sort Value:
- 2020-0002-0009-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2020-07-08
- Subjects:
- automatic speech recognition -- multimodal integration -- open-source softwares -- robot audition -- sound-source localization -- sound source separation
Artificial intelligence -- Periodicals
Robotics -- Periodicals
Control theory -- Periodicals
006.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
https://onlinelibrary.wiley.com/journal/26404567 ↗ - DOI:
- 10.1002/aisy.202000050 ↗
- Languages:
- English
- ISSNs:
- 2640-4567
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14708.xml