A novel approach to remove outliers for parallel voice conversion. (November 2019)

Record Type:: Journal Article
Title:: A novel approach to remove outliers for parallel voice conversion. (November 2019)
Main Title:: A novel approach to remove outliers for parallel voice conversion
Authors:: Shah, Nirmesh J.
Patil, Hemant A.
Abstract:: Abstract: Alignment is a key step before learning a mapping function between a source and a target speaker's spectral features in various state-of-the-art parallel data Voice Conversion (VC) techniques. After alignment, some corresponding pairs are still inconsistent with the rest of the data and are considered outliers. These outliers shift the parameters of the mapping function from their true value and hence, negatively affect the learning of mapping function during the training phase of the VC task. To the best of the authors' knowledge, the effect of outliers (and hence, their removal) on quality of the converted voice has not been much explored in the VC literature. Recent research has shown the effectiveness of the outlier removal as a pre-processing step in the VC. In this paper, we extend this study with a detailed theory and analysis. The proposed method uses a score distance that is estimated using Robust Principal Component Analysis (ROBPCA) to detect the outliers. In particular, the outliers are determined using a fixed cut-off on the score distances, based on the degrees of freedom in a chi-squared distribution, which is speaker-pair independent. The fixed cut-off is due to the assumption that the score distances follow the normal (i.e., Gaussian) distribution. However, this is a weak statistical assumption even in the cases where quite many samples are available. Hence, in this paper, we propose to explore speaker-pair dependent cut-offs to detect the … (more)
Is Part Of:: Computer speech & language. Volume 58(2019)
Journal:: Computer speech & language
Issue:: Volume 58(2019)
Issue Display:: Volume 58, Issue 2019 (2019)
Year:: 2019
Volume:: 58
Issue:: 2019
Issue Sort Value:: 2019-0058-2019-0000
Page Start:: 127
Page End:: 152
Publication Date:: 2019-11
Subjects:: Outlier removal -- Outlier detection -- Parallel data voice conversion -- Robust Principal Component Analysis
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2019.03.009 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 13046.xml