Sparse coding over redundant dictionaries for fast adaptation of speech recognition system. (May 2017)

Record Type:: Journal Article
Title:: Sparse coding over redundant dictionaries for fast adaptation of speech recognition system. (May 2017)
Main Title:: Sparse coding over redundant dictionaries for fast adaptation of speech recognition system
Authors:: Shahnawazuddin, S.
Sinha, Rohit
Abstract:: Highlights: A novel use of sparse coding is done for the on-line adaptation of HMM based ASR systems. The target is first sparse coded using OMP over exemplar/learned speaker dictionaries. Adapted model is obtained by the maximum likelihood scaling of the sparse coded target. Performance same as the existing techniques is obtained but with much lower complexity. Abstract: This work presents a novel use of the sparse coding over redundant dictionary for fast adaptation of the acoustic models in the hidden Markov model-based automatic speech recognition (ASR) systems. The presented work is an extension of the existing acoustic model-interpolation-based fast adaptation approaches. In these methods, the basis (model) weights are estimated using an iterative procedure employing the maximum-likelihood (ML) criterion. For effective adaptation, typically a number of bases are selected and as a result of that the latency of the iterative weight estimation process becomes high for those ASR tasks that involve human-machine interactions. To address this issue, we propose the use of sparse coding of the target mean supervector over a speaker-specific (exemplar) redundant dictionary. In this approach, the employed greedy sparse coding not only selects the desired bases but also compresses them into a single supervector, which is then ML scaled to yield the adapted mean parameters. Thus reducing the latency in the basis weight estimation in comparison to the existing fast adaptation … (more)
Is Part Of:: Computer speech & language. Volume 43(2017)
Journal:: Computer speech & language
Issue:: Volume 43(2017)
Issue Display:: Volume 43, Issue 2017 (2017)
Year:: 2017
Volume:: 43
Issue:: 2017
Issue Sort Value:: 2017-0043-2017-0000
Page Start:: 1
Page End:: 17
Publication Date:: 2017-05
Subjects:: Fast adaptation -- Acoustic model interpolation -- Sparse coding -- Exemplar and learned speaker dictionary
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2016.10.004 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 276.xml