Unsupervised rapid speaker adaptation based on selective eigenvoice merging for user-specific voice interaction. (April 2015)

Record Type:: Journal Article
Title:: Unsupervised rapid speaker adaptation based on selective eigenvoice merging for user-specific voice interaction. (April 2015)
Main Title:: Unsupervised rapid speaker adaptation based on selective eigenvoice merging for user-specific voice interaction
Authors:: Choi, Dong-Jin
Park, Jeong-Sik
Oh, Yung-Hwan
Abstract:: Abstract: Speaker adaptation transforms the standard speaker-independent acoustic models into an adapted model relevant to the user (called the target speaker) in order to provide reliable speech recognition performance. Although several conventional adaptation techniques, such as Maximum Likelihood Linear Regression (MLLR) and Maximum A Posteriori (MAP), have been successfully applied to speech recognition tasks, they demonstrate great dependency on the amount of adaptation data. However, the eigenvoice-based adaptation technique is known to provide reliable performance regardless of the amount of data, even for a very small amount. In this study, we propose an efficient eigenvoice adaptation approach to construct more reliable adapted models. The proposed approach merges eigenvoice sets for possible eigenvoice combinations, and then selects optimal eigenvoice sets that are most relevant to the target speaker. For this task, we propose an efficient unsupervised eigenvoice selection method as well as a rapid merging technique. On speech recognition experiments using the Defense Advanced Research Projects Agency׳s Resource Management corpus, the proposed approach exhibited superior performance, compared to conventional methods, in both recognition accuracy and time complexity.
Is Part Of:: Engineering applications of artificial intelligence. Volume 40(2015:Apr.)
Journal:: Engineering applications of artificial intelligence
Issue:: Volume 40(2015:Apr.)
Issue Display:: Volume 40 (2015)
Year:: 2015
Volume:: 40
Issue Sort Value:: 2015-0040-0000-0000
Page Start:: 95
Page End:: 102
Publication Date:: 2015-04
Subjects:: Speaker adaptation -- Eigenvoice -- Maximum Likelihood Linear Regression -- Maximum A Posteriori -- Selective eigenvoice merging -- Speech recognition
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285
Journal URLs:: http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.engappai.2015.01.010 ↗
Languages:: English
ISSNs:: 0952-1976
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 10040.xml