Predicting speech intelligibility with deep neural networks. (March 2018)

Record Type:: Journal Article
Title:: Predicting speech intelligibility with deep neural networks. (March 2018)
Main Title:: Predicting speech intelligibility with deep neural networks
Authors:: Spille, Constantin
Ewert, Stephan D.
Kollmeier, Birger
Meyer, Bernd T.
Abstract:: Highlights: An automatic speech recognizer using deep neural networks is proposed as model to predict speech intelligibility (SI). The DNN-based model predicts SI in normal-hearing listeners more accurately than four established SI models. In contrast to baseline models, the proposed model predicts intelligibility from the noisy speech signal and does not require separated noise and speech input. A relevance propagation algorithm shows that DNNs can listen in the dips in modulated maskers. Graphical abstract: Abstract: An accurate objective prediction of human speech intelligibility is of interest for many applications such as the evaluation of signal processing algorithms. To predict the speech recognition threshold (SRT) of normal-hearing listeners, an automatic speech recognition (ASR) system is employed that uses a deep neural network (DNN) to convert the acoustic input into phoneme predictions, which are subsequently decoded into word transcripts. ASR results are obtained with and compared to data presented in Schubotz et al. (2016), which comprises eight different additive maskers that range from speech-shaped stationary noise to a single-talker interferer and responses from eight normal-hearing subjects. The task for listeners and ASR is to identify noisy words from a German matrix sentence test in monaural conditions. Two ASR training schemes typically used in applications are considered: (A) matched training, which uses the same noise type for training and testing … (more)
Is Part Of:: Computer speech & language. Volume 48(2018)
Journal:: Computer speech & language
Issue:: Volume 48(2018)
Issue Display:: Volume 48, Issue 2018 (2018)
Year:: 2018
Volume:: 48
Issue:: 2018
Issue Sort Value:: 2018-0048-2018-0000
Page Start:: 51
Page End:: 66
Publication Date:: 2018-03
Subjects:: Speech intelligibility prediction -- Deep neural networks -- Automatic speech recognition
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2017.10.004 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 5383.xml