Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement. (24th May 2009)
- Record Type:
- Journal Article
- Title:
- Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement. (24th May 2009)
- Main Title:
- Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement
- Authors:
- Schuller, Björn
Wöllmer, Martin
Moosmayr, Tobias
Rigoll, Gerhard - Other Names:
- Deng Li Academic Editor.
- Abstract:
- Abstract : Performance of speech recognition systems strongly degrades in the presence of background noise, like the driving noise inside a car. In contrast to existing works, we aim to improve noise robustness focusing on all major levels of speech recognition: feature extraction, feature enhancement, speech modelling, and training. Thereby, we give an overview of promising auditory modelling concepts, speech enhancement techniques, training strategies, and model architecture, which are implemented in an in-car digit and spelling recognition task considering noises produced by various car types and driving conditions. We prove that joint speech and noise modelling with a Switching Linear Dynamic Model (SLDM) outperforms speech enhancement techniques like Histogram Equalisation (HEQ) with a mean relative error reduction of 52.7% over various noise types and levels. Embedding a Switching Linear Dynamical System (SLDS) into a Switching Autoregressive Hidden Markov Model (SAR-HMM) prevails for speech disturbed by additive white Gaussian noise.
- Is Part Of:
- EURASIP journal on audio, speech, and music processing. Volume 2009(2009)
- Journal:
- EURASIP journal on audio, speech, and music processing
- Issue:
- Volume 2009(2009)
- Issue Display:
- Volume 2009, Issue 2009 (2009)
- Year:
- 2009
- Volume:
- 2009
- Issue:
- 2009
- Issue Sort Value:
- 2009-2009-2009-0000
- Page Start:
- Page End:
- Publication Date:
- 2009-05-24
- Subjects:
- Sound -- Recording and reproducing -- Digital techniques -- Periodicals
Computer sound processing -- Periodicals
Computer sound processing
Sound -- Recording and reproducing -- Digital techniques
Periodicals
Electronic journal
Electronic journals
620.2 - Journal URLs:
- https://asmp-eurasipjournals.springeropen.com/ ↗
http://www.hindawi.com/GetJournal.aspx?journal=ASMP ↗
http://www.hindawi.com/journals/asmp/contents.html ↗
http://link.springer.com/ ↗ - DOI:
- 10.1155/2009/942617 ↗
- Languages:
- English
- ISSNs:
- 1687-4714
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10487.xml