Code-switched automatic speech recognition in five South African languages. (January 2022)

Record Type:: Journal Article
Title:: Code-switched automatic speech recognition in five South African languages. (January 2022)
Main Title:: Code-switched automatic speech recognition in five South African languages
Authors:: Biswas, Astik
Yılmaz, Emre
van der Westhuizen, Ewald
de Wet, Febe
Niesler, Thomas
Abstract:: Abstract: Most automatic speech recognition (ASR) systems are optimised for one specific language and their performance consequently deteriorates drastically when confronted with multilingual or code-switched speech. We describe our efforts to improve an ASR system that can process code-switched South African speech that contains English and four indigenous languages: isiZulu, isiXhosa, Sesotho and Setswana. We begin using a newly developed language-balanced corpus of code-switched speech compiled from South African soap operas, which are rich in spontaneous code-switching. The small size of the corpus makes this scenario under-resourced, and hence we explore several ways of addressing this sparsity of data. We consider augmenting the acoustic training sets with in-domain data at the expense of making it unbalanced and dominated by English. We further explore the inclusion of monolingual out-of-domain data in the constituent languages. For language modelling, we investigate the inclusion of out-of-domain text data sources and also the inclusion of synthetically-generated code-switch bigrams. In our experiments, we consider two system architectures. The first considers four bilingual speech recognisers, each allowing code-switching between English and one of the indigenous languages. The second considers a single pentalingual speech recogniser able to process switching between all five languages. We find that the additional inclusion of each acoustic and text data source … (more)
Is Part Of:: Computer speech & language. Volume 71(2022)
Journal:: Computer speech & language
Issue:: Volume 71(2022)
Issue Display:: Volume 71, Issue 2022 (2022)
Year:: 2022
Volume:: 71
Issue:: 2022
Issue Sort Value:: 2022-0071-2022-0000
Page Start:
Page End:
Publication Date:: 2022-01
Subjects:: Code-switching -- Under-resourced languages -- African languages -- Bantu languages -- Speech recognition -- TDNN–BLSTM -- TDNN-F
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2021.101262 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 19299.xml