Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary. (November 2021)

Record Type:: Journal Article
Title:: Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary. (November 2021)
Main Title:: Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary
Authors:: Drgas, Szymon
Virtanen, Tuomas
Abstract:: Highlights: The parametric dictionary allows adapting the whole dictionary by tuning a relatively small number of parameters. In a scenario where the unknown speakers' recordings are in training dataset together with recordings of many other speakers the proposed method outperforms NMD with dictionary which contains atoms of all speakers in the dataset. In the single-channel speech separation scenario where recordings of the recognized speakers were not in the training dataset, the proposed method brought clearly positive singnal-to-noise distortion ratios. Abstract: In this article, we propose a new method for joint cochannel speaker separation and recognition called adaptive-dictionary non-negative matrix deconvolution (DANMD). This method is an extension of non-negative matrix deconvolution (NMD) which models spectrogram matrix as a linear combination of dictionary elements (atoms). We propose a dictionary which is a linear combination of speaker-independent component and components representing speaker variability. The dictionary is parametric and all atoms depend on a small number of parameters. The speaker-independent component and components representing speaker variability are learned from recordings of tens or hundreds of speakers. We show that the proposed method can be applied to the single-channel speech separation task where two speakers of unknown identity are to be separated. In a scenario where the unknown speakers' recordings are in training dataset together … (more)
Is Part Of:: Computer speech & language. Volume 70(2021)
Journal:: Computer speech & language
Issue:: Volume 70(2021)
Issue Display:: Volume 70, Issue 2021 (2021)
Year:: 2021
Volume:: 70
Issue:: 2021
Issue Sort Value:: 2021-0070-2021-0000
Page Start:
Page End:
Publication Date:: 2021-11
Subjects:: Speech separation -- Cochannel speaker identification -- Non-negative matrix deconvolution
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2021.101223 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 17252.xml