Recurrent out-of-vocabulary word detection based on distribution of features. (November 2019)
- Record Type:
- Journal Article
- Title:
- Recurrent out-of-vocabulary word detection based on distribution of features. (November 2019)
- Main Title:
- Recurrent out-of-vocabulary word detection based on distribution of features
- Authors:
- Asami, Taichi
Masumura, Ryo
Aono, Yushi
Shinoda, Koichi - Abstract:
- Highlights: A novel method for robustly detecting out-of-vocabulary (OOV) words is proposed. The method focuses on the consistency of recurrent OOV words. The degree of consistency is measured by distribution of features. The proposed method achieves over 60% relative reduction in false alarms. Abstract: The repeated use of out-of-vocabulary (OOV) words in a spoken document seriously degrades a speech recognizer performance. Even though such recurrent OOV words are often important keywords in a spoken document, they are never correctly recognized. We propose a novel method for robustly detecting recurrent OOV words, which focuses on the degree of consistency among them. It first detects recurrent segments, that is recurrent phoneme sub-sequence in the output of a phoneme sequence decoder. Then, we measure the degree of consistency by using the mean and variance (distribution) of features (DOF) derived from the recurrent segments, and use our DOF for IV/OOV classification. Experiments on academic lectures illustrate that the proposed DOF-based method can robustly detect recurrent OOV words in spontaneous speech and achieves over 60% relative reduction in false alarms. It is also confirmed that detection performance improves as the OOV words are repeated more often.
- Is Part Of:
- Computer speech & language. Volume 58(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 58(2019)
- Issue Display:
- Volume 58, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 58
- Issue:
- 2019
- Issue Sort Value:
- 2019-0058-2019-0000
- Page Start:
- 247
- Page End:
- 259
- Publication Date:
- 2019-11
- Subjects:
- Speech recognition -- Out-of-vocabulary (OOV) word detection -- Recurrent OOV words -- Distribution of features
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2019.04.007 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11148.xml