Robust speaker localization for real-world robots. (November 2015)
- Record Type:
- Journal Article
- Title:
- Robust speaker localization for real-world robots. (November 2015)
- Main Title:
- Robust speaker localization for real-world robots
- Authors:
- Athanasopoulos, Georgios
Verhelst, Werner
Sahli, Hichem - Abstract:
- Abstract : Highlights: We look at the acoustic localization for real-world robots and conditions. We investigate the role of the microphones characteristics in the acoustic localization. We propose a time delay estimation smoothing which is based on the signal properties. We formulate a framework for addressing the robot's shape and surface material influence in the acoustic localization. We propose a pre-processing approach for enhancing the acoustic localization performance. Abstract: Autonomous human–robot interaction ultimately requires an artificial audition module that allows the robot to process and interpret a combination of verbal and non-verbal auditory inputs. A key component of such a module is the acoustic localization. The acoustic localization not only enables the robot to simultaneously localize multiple persons and auditory events of interest in the environment, but also provides input to auditory tasks such as speech enhancement and speech recognition. The use of microphone arrays in robots is an efficient and commonly applied approach to the localization problem. In this paper, moving away from simulated environments, we look at the acoustic localization under real-world conditions and limitations. Our approach proposes a series of enhancements, taking into account the imperfect frequency response of the array microphones and addressing the influence of the robot's shape and surface material. Motivated by the importance of the signal's phase information,Abstract : Highlights: We look at the acoustic localization for real-world robots and conditions. We investigate the role of the microphones characteristics in the acoustic localization. We propose a time delay estimation smoothing which is based on the signal properties. We formulate a framework for addressing the robot's shape and surface material influence in the acoustic localization. We propose a pre-processing approach for enhancing the acoustic localization performance. Abstract: Autonomous human–robot interaction ultimately requires an artificial audition module that allows the robot to process and interpret a combination of verbal and non-verbal auditory inputs. A key component of such a module is the acoustic localization. The acoustic localization not only enables the robot to simultaneously localize multiple persons and auditory events of interest in the environment, but also provides input to auditory tasks such as speech enhancement and speech recognition. The use of microphone arrays in robots is an efficient and commonly applied approach to the localization problem. In this paper, moving away from simulated environments, we look at the acoustic localization under real-world conditions and limitations. Our approach proposes a series of enhancements, taking into account the imperfect frequency response of the array microphones and addressing the influence of the robot's shape and surface material. Motivated by the importance of the signal's phase information, we introduce a novel pre-processing step for enhancing the acoustic localization. Results show that the proposed approach improves the localization performance in joint noisy and reverberant conditions and allows a humanoid robot to locate multiple speakers in a real-world environment. … (more)
- Is Part Of:
- Computer speech & language. Volume 34(2015)
- Journal:
- Computer speech & language
- Issue:
- Volume 34(2015)
- Issue Display:
- Volume 34, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 34
- Issue:
- 2015
- Issue Sort Value:
- 2015-0034-2015-0000
- Page Start:
- 129
- Page End:
- 153
- Publication Date:
- 2015-11
- Subjects:
- Microphone arrays -- Acoustic localization -- Time delay estimation -- Steered response power -- Phase spectrum enhancement
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2015.03.009 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9699.xml