Improving PLDA speaker verification performance using domain mismatch compensation techniques. (January 2018)
- Record Type:
- Journal Article
- Title:
- Improving PLDA speaker verification performance using domain mismatch compensation techniques. (January 2018)
- Main Title:
- Improving PLDA speaker verification performance using domain mismatch compensation techniques
- Authors:
- Rahman, Md Hafizur
Kanagasundaram, Ahilan
Himawan, Ivan
Dean, David
Sridharan, Sridha - Abstract:
- Highlights: Domain mismatch significantly affects the speaker verification performance. Domain invariant linear discriminant analysis (DI-LDA) for compensating domain mismatch in the LDA subspace. Domain invariant probabilistic linear discriminant analysis (DI-PLDA) for domain mismatch modelling n the PLDA subspace. DI-LDA approach followed by the DI-PLDA (DI-PLDA[DI-LDA]) to compensate domain mismatch from both LDA and PLDA subspaces. Limited target domain data requirement using domain mismatch compensation techniques. Abstract: The performance of state-of-the-art i-vector speaker verification systems relies on a large amount of training data for probabilistic linear discriminant analysis (PLDA) modeling. During the evaluation, it is also crucial that the target condition data is matched well with the development data used for PLDA training. However, in many practical scenarios, these systems have to be developed, and trained, using data that is often outside the domain of the intended application, since the collection of a significant amount of in-domain data is often difficult. Experimental studies have found that PLDA speaker verification performance degrades significantly due to this development/evaluation mismatch. This paper introduces a domain-invariant linear discriminant analysis (DI-LDA) technique for out-domain PLDA speaker verification that compensates domain mismatch in the LDA subspace. We also propose a domain-invariant probabilistic linear discriminantHighlights: Domain mismatch significantly affects the speaker verification performance. Domain invariant linear discriminant analysis (DI-LDA) for compensating domain mismatch in the LDA subspace. Domain invariant probabilistic linear discriminant analysis (DI-PLDA) for domain mismatch modelling n the PLDA subspace. DI-LDA approach followed by the DI-PLDA (DI-PLDA[DI-LDA]) to compensate domain mismatch from both LDA and PLDA subspaces. Limited target domain data requirement using domain mismatch compensation techniques. Abstract: The performance of state-of-the-art i-vector speaker verification systems relies on a large amount of training data for probabilistic linear discriminant analysis (PLDA) modeling. During the evaluation, it is also crucial that the target condition data is matched well with the development data used for PLDA training. However, in many practical scenarios, these systems have to be developed, and trained, using data that is often outside the domain of the intended application, since the collection of a significant amount of in-domain data is often difficult. Experimental studies have found that PLDA speaker verification performance degrades significantly due to this development/evaluation mismatch. This paper introduces a domain-invariant linear discriminant analysis (DI-LDA) technique for out-domain PLDA speaker verification that compensates domain mismatch in the LDA subspace. We also propose a domain-invariant probabilistic linear discriminant analysis (DI-PLDA) technique for domain mismatch modeling in the PLDA subspace, using only a small amount of in-domain data. In addition, we propose the sequential and score-level combination of DI-LDA, and DI-PLDA to further improve out-domain speaker verification performance. Experimental results show the proposed domain mismatch compensation techniques yield at least 27% and 14.5% improvement in equal error rate (EER) over a pooled PLDA system for telephone-telephone and interview-interview conditions, respectively. Finally, we show that the improvement over the baseline pooled system can be attained even when significantly reducing the number of in-domain speakers, down to 30 in most of the evaluation conditions. … (more)
- Is Part Of:
- Computer speech & language. Volume 47(2018)
- Journal:
- Computer speech & language
- Issue:
- Volume 47(2018)
- Issue Display:
- Volume 47, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 47
- Issue:
- 2018
- Issue Sort Value:
- 2018-0047-2018-0000
- Page Start:
- 240
- Page End:
- 258
- Publication Date:
- 2018-01
- Subjects:
- Speaker verification -- I-vector -- Domain mismatch compensation -- DI-LDA -- DI-PLDA -- DI-PLDA[DI-LDA] -- Score fusion
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2017.08.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20786.xml