Head‐related transfer function–reserved time‐frequency masking for robust binaural sound source localization. Issue 1 (2nd March 2021)
- Record Type:
- Journal Article
- Title:
- Head‐related transfer function–reserved time‐frequency masking for robust binaural sound source localization. Issue 1 (2nd March 2021)
- Main Title:
- Head‐related transfer function–reserved time‐frequency masking for robust binaural sound source localization
- Authors:
- Liu, Hong
Yuan, Peipei
Yang, Bing
Yang, Ge
Chen, Yang - Abstract:
- Abstract: Various time‐frequency (T‐F) masks are being applied to sound source localization tasks. Moreover, deep learning has dramatically advanced T‐F mask estimation. However, existing masks are usually designed for speech separation tasks and are suitable only for single‐channel signals. A novel complex‐valued T‐F mask is proposed that reserves the head‐related transfer function (HRTF), customized for binaural sound source localization. In addition, because the convolutional neural network that is exploited to estimate the proposed mask takes binaural spectral information as the input and output, accurate binaural cues can be preserved. Compared with conventional T‐F masks that emphasize single speech source–dominated T‐F units, HRTF‐reserved masks eliminate the speech component while keeping the direct propagation path. Thus, the estimated HRTF is capable of extracting more reliable localization features for the final direction of arrival estimation. Hence, binaural sound source localization guided by the proposed T‐F mask is robust under noisy and reverberant acoustic environments. The experimental results demonstrate that the new T‐F mask is superior to conventional T‐F masks and lead to the better performance of sound source localization in adverse environments.
- Is Part Of:
- CAAI transactions on intelligence technology. Volume 7:Issue 1(2022)
- Journal:
- CAAI transactions on intelligence technology
- Issue:
- Volume 7:Issue 1(2022)
- Issue Display:
- Volume 7, Issue 1 (2022)
- Year:
- 2022
- Volume:
- 7
- Issue:
- 1
- Issue Sort Value:
- 2022-0007-0001-0000
- Page Start:
- 26
- Page End:
- 33
- Publication Date:
- 2021-03-02
- Subjects:
- speech processing -- reverberation -- acoustic signal processing -- transfer functions -- deep learning (artificial intelligence) -- convolutional neural nets
Artificial intelligence -- Periodicals
Computer science -- Periodicals
Artificial intelligence
Computer science
Electronic journals
Periodicals
006.305 - Journal URLs:
- https://digital-library.theiet.org/content/journals/trit ↗
https://ietresearch.onlinelibrary.wiley.com/journal/24682322 ↗
http://search.ebscohost.com/login.aspx?direct=true&site=edspub-live&scope=site&type=44&db=edspub&authtype=ip, guest&custid=ns011247&groupid=main&profile=eds&bquery=AN%2010129651 ↗
http://www.sciencedirect.com/ ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1049/cit2.12010 ↗
- Languages:
- English
- ISSNs:
- 2468-6557
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2943.720000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26241.xml