On the use of acoustic features for automatic disambiguation of homophones in spontaneous German. (November 2018)

Record Type:: Journal Article
Title:: On the use of acoustic features for automatic disambiguation of homophones in spontaneous German. (November 2018)
Main Title:: On the use of acoustic features for automatic disambiguation of homophones in spontaneous German
Authors:: Schuppler, Barbara
Schrank, Tobias
Abstract:: Highlights: Hypothesis: in spontaneous speech, homophones differ in their phonetic detail. 193 acoustic features and random forests are used for homophone disambiguation. Temporal features result to be most relevant to automatic homophone disambiguation. Model with acoustic features outperforms lexical model by 26% F 1 . Abstract: Homophones pose serious issues for automatic speech recognition (ASR) as they have the same pronunciation but different meanings or spellings. Homophone disambiguation is usually done within a stochastic language model or by an analysis of the homophonous word's context, similarly to word sense disambiguation. Whereas this method reaches good results in read speech, it fails in conversational, spontaneous speech, where utterances are often short, contain disfluencies and/or are realized syntactically incomplete. Phonetic studies, however, have shown that words that are homophonous in read speech often differ in their phonetic detail in spontaneous speech. Whereas humans use phonetic detail to disambiguate homophones, this linguistic information is usually not explicitly incorporated into ASR systems. In this paper, we show that phonetic detail can be used to automatically disambiguate homophones using the example of German pronouns. Using 3179 homophonous tokens from a corpus of spontaneous German and a set of acoustic features, we trained a random forest model. Our results show that homophones can be disambiguated reasonably well using acoustic … (more)
Is Part Of:: Computer speech & language. Volume 52(2018)
Journal:: Computer speech & language
Issue:: Volume 52(2018)
Issue Display:: Volume 52, Issue 2018 (2018)
Year:: 2018
Volume:: 52
Issue:: 2018
Issue Sort Value:: 2018-0052-2018-0000
Page Start:: 209
Page End:: 224
Publication Date:: 2018-11
Subjects:: Homophone disambiguation -- Automatic speech recognition -- Phonetic detail -- Spontaneous speech -- Random forests
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2017.12.011 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 17055.xml