Speaker verification based on the fusion of speech acoustics and inverted articulatory signals. (March 2016)

Record Type:: Journal Article
Title:: Speaker verification based on the fusion of speech acoustics and inverted articulatory signals. (March 2016)
Main Title:: Speaker verification based on the fusion of speech acoustics and inverted articulatory signals
Authors:: Li, Ming
Kim, Jangwon
Lammert, Adam
Ghosh, Prasanta Kumar
Ramanarayanan, Vikram
Narayanan, Shrikanth
Abstract:: Abstract : Highlights: A practical feature-level and score-level fusion approach for speaker verification with articulatory information. Concatenating real articulatory measurements with MFCCs improves the performance. Concatenating acoustic-to-articulatory inversion features with MFCCs also improves the result. Abstract: We propose a practical, feature-level and score-level fusion approach by combining acoustic and estimated articulatory information for both text independent and text dependent speaker verification. From a practical point of view, we study how to improve speaker verification performance by combining dynamic articulatory information with the conventional acoustic features. On text independent speaker verification, we find that concatenating articulatory features obtained from measured speech production data with conventional Mel-frequency cepstral coefficients (MFCCs) improves the performance dramatically. However, since directly measuring articulatory data is not feasible in many real world applications, we also experiment with estimated articulatory features obtained through acoustic-to-articulatory inversion. We explore both feature level and score level fusion methods and find that the overall system performance is significantly enhanced even with estimated articulatory features. Such a performance boost could be due to the inter-speaker variation information embedded in the estimated articulatory features. Since the dynamics of articulation contain … (more)
Is Part Of:: Computer speech & language. Volume 36(2016)
Journal:: Computer speech & language
Issue:: Volume 36(2016)
Issue Display:: Volume 36, Issue 2016 (2016)
Year:: 2016
Volume:: 36
Issue:: 2016
Issue Sort Value:: 2016-0036-2016-0000
Page Start:: 196
Page End:: 211
Publication Date:: 2016-03
Subjects:: Text independent speaker verification -- Text dependent speaker verification -- Speech production -- Articulatory features -- Acoustic-to-articulatory inversion
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2015.05.003 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 528.xml