Unsupervised language model adaptation using LDA-based mixture models and latent semantic marginals. (January 2015)

Record Type:: Journal Article
Title:: Unsupervised language model adaptation using LDA-based mixture models and latent semantic marginals. (January 2015)
Main Title:: Unsupervised language model adaptation using LDA-based mixture models and latent semantic marginals
Authors:: Haidar, Md. Akmal
O'Shaughnessy, Douglas
Abstract:: Abstract : Highlights: Unsupervised language model adaptation approaches for speech recognition. A hard-clustering approach is applied into latent Dirichlet allocation (LDA) model. Tri-gram topic models are created and mixed to form an adapted tri-gram model. The adapted tri-gram model is interpolated with a background tri-gram model. The above models are modified using unigram latent semantic marginals. Abstract: In this paper, we present unsupervised language model (LM) adaptation approaches using latent Dirichlet allocation (LDA) and latent semantic marginals (LSM). The LSM is the unigram probability distribution over words that are calculated using LDA-adapted unigram models. The LDA model is used to extract topic information from a training corpus in an unsupervised manner. The LDA model yields a document–topic matrix that describes the number of words assigned to topics for the documents. A hard-clustering method is applied on the document–topic matrix of the LDA model to form topics. An adapted model is created by using a weighted combination of the n -gram topic models. The stand-alone adapted model outperforms the background model. The interpolation of the background model and the adapted model gives further improvement. We modify the above models using the LSM. The LSM is used to form a new adapted model by using the minimum discriminant information (MDI) adaptation approach called unigram scaling, which minimizes the distance between the new adapted model and the … (more)
Is Part Of:: Computer speech & language. Volume 29(2015)
Journal:: Computer speech & language
Issue:: Volume 29(2015)
Issue Display:: Volume 29, Issue 2015 (2015)
Year:: 2015
Volume:: 29
Issue:: 2015
Issue Sort Value:: 2015-0029-2015-0000
Page Start:: 20
Page End:: 31
Publication Date:: 2015-01
Subjects:: Language model -- Topic model -- Mixture model -- Speech recognition -- Minimum discriminant information
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2014.06.002 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 5426.xml