Emotion transplantation through adaptation in HMM-based speech synthesis. (November 2015)

Record Type:: Journal Article
Title:: Emotion transplantation through adaptation in HMM-based speech synthesis. (November 2015)
Main Title:: Emotion transplantation through adaptation in HMM-based speech synthesis
Authors:: Lorenzo-Trueba, Jaime
Barra-Chicote, Roberto
San-Segundo, Rubén
Ferreiros, Javier
Yamagishi, Junichi
Montero, Juan M.
Abstract:: Abstract : Highlights: We propose an emotion transplantation method based on adaptation techniques. Emotions can be imbued into neutral synthetic speech models regardless of gender. Five perceptual evaluations, including one with a robot, were carried out. Emotion transplantation clearly improves emotional performance over neutral voices. High quality source models provide high quality transplanted models. Abstract: This paper proposes an emotion transplantation method capable of modifying a synthetic speech model through the use of CSMAPLR adaptation in order to incorporate emotional information learned from a different speaker model while maintaining the identity of the original speaker as much as possible. The proposed method relies on learning both emotional and speaker identity information by means of their adaptation function from an average voice model, and combining them into a single cascade transform capable of imbuing the desired emotion into the target speaker. This method is then applied to the task of transplanting four emotions (anger, happiness, sadness and surprise) into 3 male speakers and 3 female speakers and evaluated in a number of perceptual tests. The results of the evaluations show how the perceived naturalness for emotional text significantly favors the use of the proposed transplanted emotional speech synthesis when compared to traditional neutral speech synthesis, evidenced by a big increase in the perceived emotional strength of the synthesized … (more)
Is Part Of:: Computer speech & language. Volume 34(2015)
Journal:: Computer speech & language
Issue:: Volume 34(2015)
Issue Display:: Volume 34, Issue 2015 (2015)
Year:: 2015
Volume:: 34
Issue:: 2015
Issue Sort Value:: 2015-0034-2015-0000
Page Start:: 292
Page End:: 307
Publication Date:: 2015-11
Subjects:: Statistical parametric speech synthesis -- Expressive speech synthesis -- Cascade adaptation -- Emotion transplantation
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2015.03.008 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 6446.xml