Robot Hearing Through Optical Channel in a Cocktail Party Environment. (9th December 2022)
- Record Type:
- Journal Article
- Title:
- Robot Hearing Through Optical Channel in a Cocktail Party Environment. (9th December 2022)
- Main Title:
- Robot Hearing Through Optical Channel in a Cocktail Party Environment
- Authors:
- Guo, Xiao
Ding, Siyi
Peng, Ti
Li, Kenan
Hong, Xiaoping - Abstract:
- Abstract : The cocktail party problem refers to a challenging process when the human sensory system tries to separate a specific voice from a loud mixture of background sound sources. The problem is much more demanding for machines and has become the holy grail in robotic hearing. Despite many advances in noise suppression, the intrinsic information from the contaminated acoustic channel remains difficult to recover. Herein, a simple‐yet‐powerful laser‐assisted audio system termed robot ear accomplished by laser (REAL) is shown to probe the vibrations of sound‐carrying surfaces (mask, throat, and other nearby surfaces) in optical channel, which is intrinsically immune to acoustic background noises. The results demonstrate that REAL can directly obtain the audio‐frequency content from the laser without acoustic channel interference. The signals can be further transcribed into human‐recognizable audios by exploiting the internal time and frequency correlations through memory‐enabled neural networks. The REAL system would enable a new way in human–robot interaction. An interactive preprint version of the article can be found at: https://www.authorea.com/doi/full/10.1002/aisy.202200143 . Abstract : The cocktail party problem, hearing a specific speaker in a loud noisy environment, is extremely difficult for machines and has become the holy grail in audio processing research. A robotic ear termed robot ear accomplished by laser is devised to accurately recognize human voices inAbstract : The cocktail party problem refers to a challenging process when the human sensory system tries to separate a specific voice from a loud mixture of background sound sources. The problem is much more demanding for machines and has become the holy grail in robotic hearing. Despite many advances in noise suppression, the intrinsic information from the contaminated acoustic channel remains difficult to recover. Herein, a simple‐yet‐powerful laser‐assisted audio system termed robot ear accomplished by laser (REAL) is shown to probe the vibrations of sound‐carrying surfaces (mask, throat, and other nearby surfaces) in optical channel, which is intrinsically immune to acoustic background noises. The results demonstrate that REAL can directly obtain the audio‐frequency content from the laser without acoustic channel interference. The signals can be further transcribed into human‐recognizable audios by exploiting the internal time and frequency correlations through memory‐enabled neural networks. The REAL system would enable a new way in human–robot interaction. An interactive preprint version of the article can be found at: https://www.authorea.com/doi/full/10.1002/aisy.202200143 . Abstract : The cocktail party problem, hearing a specific speaker in a loud noisy environment, is extremely difficult for machines and has become the holy grail in audio processing research. A robotic ear termed robot ear accomplished by laser is devised to accurately recognize human voices in cocktail party environments by remotely probing the vibrations of sound‐carrying surfaces by optical means. … (more)
- Is Part Of:
- Advanced intelligent systems. Volume 5:Number 1(2023)
- Journal:
- Advanced intelligent systems
- Issue:
- Volume 5:Number 1(2023)
- Issue Display:
- Volume 5, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 5
- Issue:
- 1
- Issue Sort Value:
- 2023-0005-0001-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2022-12-09
- Subjects:
- cocktail party problem -- human–robot interaction -- optical audio collection -- robot hearing
Artificial intelligence -- Periodicals
Robotics -- Periodicals
Control theory -- Periodicals
006.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
https://onlinelibrary.wiley.com/journal/26404567 ↗ - DOI:
- 10.1002/aisy.202200143 ↗
- Languages:
- English
- ISSNs:
- 2640-4567
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25157.xml