Efficient estimation and model generalization for the totalvariability model. (January 2019)
- Record Type:
- Journal Article
- Title:
- Efficient estimation and model generalization for the totalvariability model. (January 2019)
- Main Title:
- Efficient estimation and model generalization for the totalvariability model
- Authors:
- Travadi, Ruchir
Narayanan, Shrikanth - Abstract:
- Abstract: A number of audio signal processing applications characterize different properties of the source underlying an audio signal by analyzing the distribution of a sequence of feature vectors obtained from the signal. The Total Variability Model has been widely used for this purpose as a mechanism for capturing the variability in the feature vector distribution across different signals within a low dimensional representation. In order to arrive at a compact representation, a number of assumptions are made within the model regarding the properties of this distribution. In this paper, we first present an analysis of a parameter estimation method for the model which offers a computationally efficient alternative to the widely used Expectation Maximization (EM) algorithm, but relies on the validity of the model assumptions, using experiments on speaker and language identification tasks. To explain some of the results obtained using this method, we present an extensive statistical analysis aimed at verifying the validity of some of the model assumptions. We show that many of these model assumptions are not valid for the observed data, and propose model generalizations to replace these assumptions. The proposed generalizations lead to a better performance while also opening up possibilities for discriminative training of the model.
- Is Part Of:
- Computer speech & language. Volume 53(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 53(2019)
- Issue Display:
- Volume 53, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 53
- Issue:
- 2019
- Issue Sort Value:
- 2019-0053-2019-0000
- Page Start:
- 43
- Page End:
- 64
- Publication Date:
- 2019-01
- Subjects:
- Total variability model -- i-vector -- Speaker identification -- Language identification
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2018.07.003 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7651.xml