A fuzzy‐clustering‐based hierarchical i‐vector/probabilistic linear discriminant analysis system for text‐dependent speaker verification. Issue 3 (30th January 2020)

Record Type:: Journal Article
Title:: A fuzzy‐clustering‐based hierarchical i‐vector/probabilistic linear discriminant analysis system for text‐dependent speaker verification. Issue 3 (30th January 2020)
Main Title:: A fuzzy‐clustering‐based hierarchical i‐vector/probabilistic linear discriminant analysis system for text‐dependent speaker verification
Authors:: Laskar, Mohammad Azharuddin
Laskar, Rabul Hussain
Other Names:: Saen Reza Farzipoor guestEditor.
Song Malin guestEditor.
Fisher Ron guestEditor.
Abstract:: Abstract: In the i‐vector/probabilistic linear discriminant analysis (PLDA) technique, the PLDA backend classifier is modelled on i‐vectors. PLDA defines an i‐vector subspace that compensates the unwanted variability and helps to discriminate among speaker‐phrase pairs. The channel or session variability manifested in i‐vectors are known to be nonlinear in nature. PLDA training, however, assumes the variability to be linearly separable, thereby causing loss of important discriminating information. Besides, the i‐vector estimation, itself, is known to be poor in case of short utterances. This paper attempts to address these issues using a simple hierarchy‐based system. A modified fuzzy‐clustering technique is employed to divide the feature space into more characteristic feature subspaces using vocal source features. Thereafter, a separate i‐vector/PLDA model is trained for each of the subspaces. The sparser alignment owing to subspace‐specific universal background model and the relatively reduced dimensions of variability in individual subspaces help to train more effective i‐vector/PLDA models. Also, vocal source features are complementary to mel frequency cepstral coefficients, which are transformed into i‐vectors using mixture model technique. As a consequence, vocal source features and i‐vectors tend to have complementary information. Thus using vocal source features for classification in a hierarchy tree may help to differentiate some of the speaker‐phrase classes, which … (more)
Is Part Of:: Expert systems. Volume 37:Issue 3(2020)
Journal:: Expert systems
Issue:: Volume 37:Issue 3(2020)
Issue Display:: Volume 37, Issue 3 (2020)
Year:: 2020
Volume:: 37
Issue:: 3
Issue Sort Value:: 2020-0037-0003-0000
Page Start:: n/a
Page End:: n/a
Publication Date:: 2020-01-30
Subjects:: fuzzy clustering -- hierarchy system -- i‐vector -- text‐dependent speaker verification
Expert systems (Computer science)
006.33
Journal URLs:: http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1468-0394 ↗
http://onlinelibrary.wiley.com/ ↗
DOI:: 10.1111/exsy.12496 ↗
Languages:: English
ISSNs:: 0266-4720
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3842.004000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 13175.xml