Multilingual and unsupervised subword modeling for zero-resource languages. (January 2021)

Record Type:: Journal Article
Title:: Multilingual and unsupervised subword modeling for zero-resource languages. (January 2021)
Main Title:: Multilingual and unsupervised subword modeling for zero-resource languages
Authors:: Hermann, Enno
Kamper, Herman
Goldwater, Sharon
Abstract:: Highlights: VTLN is a useful preprocessing step for unsupervised speech processing systems. Cross-lingual pre-training improves over target-language-only unsupervised training. Multilingual bottleneck features (BNFs) can be directly applied to unseen languages. Training BNFs on more languages improves cross-lingual word discrimination. An equivalent amount of data in a single language does not help as much. Abstract: Subword modeling for zero-resource languages aims to learn low-level representations of speech audio without using transcriptions or other resources from the target language (such as text corpora or pronunciation dictionaries). A good representation should capture phonetic content and abstract away from other types of variability, such as speaker differences and channel noise. Previous work in this area has primarily focused unsupervised learning from target language data only, and has been evaluated only intrinsically. Here we directly compare multiple methods, including some that use only target language speech data and some that use transcribed speech from other (non-target) languages, and we evaluate using two intrinsic measures as well as on a downstream unsupervised word segmentation and clustering task. We find that combining two existing target-language-only methods yields better features than either method alone. Nevertheless, even better results are obtained by extracting target language bottleneck features using a model trained on other languages. … (more)
Is Part Of:: Computer speech & language. Volume 65(2021)
Journal:: Computer speech & language
Issue:: Volume 65(2021)
Issue Display:: Volume 65, Issue 2021 (2021)
Year:: 2021
Volume:: 65
Issue:: 2021
Issue Sort Value:: 2021-0065-2021-0000
Page Start:
Page End:
Publication Date:: 2021-01
Subjects:: Multilingual bottleneck features -- Subword modeling -- Unsupervised feature extraction -- Zero-resource speech technology
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2020.101098 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 16859.xml