A new speech corpus of super-elderly Japanese for acoustic modeling. (January 2023)

Record Type:: Journal Article
Title:: A new speech corpus of super-elderly Japanese for acoustic modeling. (January 2023)
Main Title:: A new speech corpus of super-elderly Japanese for acoustic modeling
Authors:: Fukuda, Meiko
Nishimura, Ryota
Nishizaki, Hiromitsu
Horii, Koharu
Iribe, Yurie
Yamamoto, Kazumasa
Kitaoka, Norihide
Abstract:: Abstract: The development of accessible speech recognition technology will allow the elderly to more easily access electronically stored information. However, the necessary level of recognition accuracy for elderly speech has not yet been achieved using conventional speech recognition systems, due to the unique features of the speech of elderly people. To address this problem, we have created a new speech corpus named EARS (Elderly Adults Read Speech), consisting of the recorded read speech of 123 super-elderly Japanese people (average age: 83.1), as a resource for training automated speech recognition models for the elderly. In this study, we investigated the acoustic features of super-elderly Japanese speech using our new speech corpus. In comparison to the speech of less elderly Japanese speakers, we observed a slower speech rate and extended vowel duration for both genders, a slight increase in fundamental frequency for males, and a slight decrease in fundamental frequency for females. To demonstrate the efficacy of our corpus, we also conducted speech recognition experiments using two different acoustic models (DNN-HMM and transformer-based), trained with a combination of data from our corpus and speech data from three conventional Japanese speech corpora. When using the DNN-HMM trained with EARS and speech data from existing corpora, the character error rate (CER) was reduced by 7.8% (to just over 9%), compared to a CER of 16.9% when using only the baseline training … (more)
Is Part Of:: Computer speech & language. Volume 77(2023)
Journal:: Computer speech & language
Issue:: Volume 77(2023)
Issue Display:: Volume 77, Issue 2023 (2023)
Year:: 2023
Volume:: 77
Issue:: 2023
Issue Sort Value:: 2023-0077-2023-0000
Page Start:
Page End:
Publication Date:: 2023-01
Subjects:: 00-01 -- 99-00
Speech corpus -- Elderly -- Speech recognition -- Acoustic feature
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2022.101424 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 23382.xml