Text-to-speech synthesis system with Arabic diacritic recognition system. (November 2015)

Record Type:: Journal Article
Title:: Text-to-speech synthesis system with Arabic diacritic recognition system. (November 2015)
Main Title:: Text-to-speech synthesis system with Arabic diacritic recognition system
Authors:: Rebai, Ilyes
BenAyed, Yassine
Abstract:: Abstract : Highlights: We developed an Arabic text-to-speech system, including a diacritization system. The speech synthesis system is based on statistical parametric. We address the accuracy of diacritic and acoustic models. We proposed a diacritization system based on the position of the current letter. Neural network per unit type based synthesis system generates high speech quality. Abstract: Text-to-speech synthesis system has been widely studied for many languages. However, speech synthesis for Arabic language has not sufficient progresses and it is still in its first stage. Statistical parametric synthesis based on hidden Markov models was the most commonly applied approach for Arabic language. Recently, synthesized speech quality based on deep neural networks was found as intelligible as human voice. This paper describes a Text-To-Speech (TTS) synthesis system for modern standard Arabic language based on statistical parametric approach and Mel-cepstral coefficients. Deep neural networks achieved state-of-the-art performance in a wide range of tasks, including speech synthesis. Our TTS system includes a diacritization system which is very important for Arabic TTS application. Our diacritization system is also based on deep neural networks. In addition to the use deep techniques, different methods were also proposed to model the acoustic parameters in order to address the problem of acoustic models accuracy. They are based on linguistic and acoustic characteristics … (more)
Is Part Of:: Computer speech & language. Volume 34(2015)
Journal:: Computer speech & language
Issue:: Volume 34(2015)
Issue Display:: Volume 34, Issue 2015 (2015)
Year:: 2015
Volume:: 34
Issue:: 2015
Issue Sort Value:: 2015-0034-2015-0000
Page Start:: 43
Page End:: 60
Publication Date:: 2015-11
Subjects:: Text-to-speech synthesis -- Statistical parametric -- Deep neural networks -- Natural language processing -- Diacritization system
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2015.04.002 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 6446.xml