End-to-end DNN based text-independent speaker recognition for long and short utterances. (January 2020)

Record Type:: Journal Article
Title:: End-to-end DNN based text-independent speaker recognition for long and short utterances. (January 2020)
Main Title:: End-to-end DNN based text-independent speaker recognition for long and short utterances
Authors:: Rohdin, Johan
Silnova, Anna
Diez, Mireia
Plchot, Oldřich
Matějka, Pavel
Burget, Lukáš
Glembek, Ondřej
Abstract:: Abstract: Recently several end-to-end speaker verification systems based on deep neural networks (DNNs) have been proposed. These systems have been proven to be competitive for text-dependent tasks as well as for text-independent tasks with short utterances. However, for text-independent tasks with longer utterances, end-to-end systems are still outperformed by standard i-vector + PLDA systems. In this work, we present an end-to-end speaker verification system that is initialized to mimic an i-vector + PLDA baseline. The system is then further trained in an end-to-end manner but regularized so that it does not deviate too far from the initial system. In this way we mitigate overfitting which normally limits the performance of end-to-end systems. The proposed system outperforms the i-vector + PLDA baseline on both long and short duration utterances.
Is Part Of:: Computer speech & language. Volume 59(2020)
Journal:: Computer speech & language
Issue:: Volume 59(2020)
Issue Display:: Volume 59, Issue 2020 (2020)
Year:: 2020
Volume:: 59
Issue:: 2020
Issue Sort Value:: 2020-0059-2020-0000
Page Start:: 22
Page End:: 35
Publication Date:: 2020-01
Subjects:: Speaker verification -- DNN -- End-to-end -- Text-independent -- i-vector -- PLDA
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2019.06.002 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 11888.xml