Neural candidate-aware language models for speech recognition. (March 2021)

Record Type:: Journal Article
Title:: Neural candidate-aware language models for speech recognition. (March 2021)
Main Title:: Neural candidate-aware language models for speech recognition
Authors:: Tanaka, Tomohiro
Masumura, Ryo
Oba, Takanobu
Abstract:: Abstract: This paper presents novel neural network based language models that can correct automatic speech recognition (ASR) errors by using speech recognizer outputs as a context. Our proposed models, called neural candidate-aware language models (NCALMs), estimate the generative probability of a target sentence while considering ASR outputs including hypotheses and their posterior probabilities. Recently, neural network language models have achieved great success in ASR field because of their ability to learn long-range contexts and model the word representation in continuous space. However, they estimate a sentence probability without considering other candidates and their posterior probabilities, even though the competing hypotheses are available and include important information to increase the speech recognition accuracy. To overcome this limitation, our idea is to utilize ASR outputs in both the training phase and the inference phase. Our proposed models are conditional generative models consisting of a Transformer encoder and a Transformer decoder. The encoder embeds the candidates as context vectors and the decoder estimates a sentence probability given the context vectors. We evaluate the proposed models in Japanese lecture transcription and English conversational speech recognition tasks. Experimental results show that a NCALM has better ASR performance than a system including a deep neural network-hidden Markov model hybrid system. We further improve ASR … (more)
Is Part Of:: Computer speech & language. Volume 66(2021)
Journal:: Computer speech & language
Issue:: Volume 66(2021)
Issue Display:: Volume 66, Issue 2021 (2021)
Year:: 2021
Volume:: 66
Issue:: 2021
Issue Sort Value:: 2021-0066-2021-0000
Page Start:
Page End:
Publication Date:: 2021-03
Subjects:: Automatic speech recognition -- Neural network language models -- Rescoring hypotheses -- Conditional generative language models
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2020.101157 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 15413.xml