Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages. (September 2019)
- Record Type:
- Journal Article
- Title:
- Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages. (September 2019)
- Main Title:
- Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages
- Authors:
- Agenbag, Wiehan
Niesler, Thomas - Abstract:
- Abstract: We present a method enabling the unsupervised discovery of sub-word units (SWUs) and associated pronunciation lexicons for use in automatic speech recognition (ASR) systems. This includes a novel SWU discovery approach based on self-organising HMM-GMM states that are agglomeratively tied across words as well as a novel pronunciation lexicon induction approach that iteratively reduces pronunciation variation by means of model pruning. Our approach relies only on recorded speech and associated orthographic transcriptions and does not require alphabetic graphemes. We apply our methods to corpora of recorded radio broadcasts in Ugandan English, Luganda and Acholi, of which the latter two are under-resourced. The speech is conversational and contains high levels of background noise, and therefore presents a challenge to automatic lexicon induction. We demonstrate that our proposed method is able to discover lexicons that perform as well as baseline expert systems for Acholi, and close to this level for the other two languages when used to train DNN-HMM ASR systems. This demonstrates the potential of the method to enable and accelerate ASR for under-resourced languages for which a phone inventory and pronunciation lexicon are not available by eliminating the dependence on human expertise this usually requires.
- Is Part Of:
- Computer speech & language. Volume 57(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 57(2019)
- Issue Display:
- Volume 57, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 57
- Issue:
- 2019
- Issue Sort Value:
- 2019-0057-2019-0000
- Page Start:
- 20
- Page End:
- 40
- Publication Date:
- 2019-09
- Subjects:
- Unsupervised SWU discovery -- Automatic lexicon induction -- ASR -- Under-resourced languages
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2019.02.002 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10443.xml