Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework. (March 2017)
- Record Type:
- Journal Article
- Title:
- Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework. (March 2017)
- Main Title:
- Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework
- Authors:
- Harding, Philip
Milner, Ben - Abstract:
- Highlights: We propose an integrated statistical framework to estimate acoustic speech features. Speaker and noise adaptation methods are developed and applied to noise-free models. Compared to other methods, the proposal is more accurate particularly at low SNRs. Abstract: Accurate estimation of acoustic speech features from noisy speech and from different speakers is an ongoing problem in speech processing. Many methods have been proposed to estimate acoustic features but errors increase as signal-to-noise ratios fall. This work proposes a robust statistical framework to estimate an acoustic speech vector (comprising voicing, fundamental frequency and spectral envelope) from an intermediate feature that is extracted from a noisy time-domain speech signal. The initial approach is accurate in clean conditions but deteriorates in noise and with changing speaker. Adaptation methods are then developed to adjust the acoustic models to the noise conditions and speaker. Evaluations are carried out in stationary and nonstationary noises and at SNRs from −5 dB to clean conditions. Comparison with conventional methods of estimating fundamental frequency, voicing and spectral envelope reveals the proposed framework to have lowest errors in all conditions tested.
- Is Part Of:
- Computer speech & language. Volume 42(2017)
- Journal:
- Computer speech & language
- Issue:
- Volume 42(2017)
- Issue Display:
- Volume 42, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 42
- Issue:
- 2017
- Issue Sort Value:
- 2017-0042-2017-0000
- Page Start:
- 1
- Page End:
- 19
- Publication Date:
- 2017-03
- Subjects:
- Voicing -- Fundamental frequency -- Spectral envelope -- Noise adaptation -- Speaker adaptation
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2016.08.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 704.xml