On the use of deep feedforward neural networks for automatic language identification. (November 2016)

Record Type:: Journal Article
Title:: On the use of deep feedforward neural networks for automatic language identification. (November 2016)
Main Title:: On the use of deep feedforward neural networks for automatic language identification
Authors:: Lopez-Moreno, Ignacio
Gonzalez-Dominguez, Javier
Martinez, David
Plchot, Oldřich
Gonzalez-Rodriguez, Joaquin
Moreno, Pedro J.
Abstract:: Highlights: This work presents a comprehensive study on the use of deep neural networks for automatic language identification. It includes a detailed performance analysis for different data selection strategies and DNN architectures. Proposed systems are tested on the NIST Language Recognition Evaluation 2009, against an state-of-the-art i-vector baseline. It also presents a novel approach that combines DNN and i-vector systems by using bottleneck features. The combination of i-vector and bottleneck systems outperforms our baseline system by 45% in EER and Cavg, on 3s and 10s. Abstract: In this work, we present a comprehensive study on the use of deep neural networks (DNNs) for automatic language identification (LID). Motivated by the recent success of using DNNs in acoustic modeling for speech recognition, we adapt DNNs to the problem of identifying the language in a given utterance from its short-term acoustic features. We propose two different DNN-based approaches. In the first one, the DNN acts as an end-to-end LID classifier, receiving as input the speech features and providing as output the estimated probabilities of the target languages. In the second approach, the DNN is used to extract bottleneck features that are then used as inputs for a state-of-the-art i-vector system. Experiments are conducted in two different scenarios: the complete NIST Language Recognition Evaluation dataset 2009 (LRE'09) and a subset of the Voice of America (VOA) data from LRE'09, in which … (more)
Is Part Of:: Computer speech & language. Volume 40(2016)
Journal:: Computer speech & language
Issue:: Volume 40(2016)
Issue Display:: Volume 40, Issue 2016 (2016)
Year:: 2016
Volume:: 40
Issue:: 2016
Issue Sort Value:: 2016-0040-2016-0000
Page Start:: 46
Page End:: 59
Publication Date:: 2016-11
Subjects:: LID -- DNN -- Bottleneck -- i-vectors
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2016.03.001 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 1262.xml