Robust continuous digit recognition using Reservoir Computing. (March 2015)
- Record Type:
- Journal Article
- Title:
- Robust continuous digit recognition using Reservoir Computing. (March 2015)
- Main Title:
- Robust continuous digit recognition using Reservoir Computing
- Authors:
- Jalalvand, Azarakhsh
Triefenbach, Fabian
Demuynck, Kris
Martens, Jean-Pierre - Abstract:
- Abstract : Highlights: Study of robustness of Reservoir Computing (RC) based continuous digit recognizers. Discovery of new relations between RC control parameters, input and output dynamics. Use of these relations to find heuristics to reduce the reservoir development time. Creation of an RC-based recognizer that is more noise robust than the AFE-GMM-HMM. Abstract: It is acknowledged that Hidden Markov Models (HMMs) with Gaussian Mixture Models (GMMs) as the observation density functions achieve excellent digit recognition performance at high signal to noise ratios (SNRs). Moreover, many years of research have led to good techniques to reduce the impact of noise, distortion and mismatch between training and test conditions on the recognition accuracy. Nevertheless, we still await systems that are truly robust against these confounding factors. The present paper extends recent work on acoustic modeling based on Reservoir Computing (RC), a concept that has its roots in Machine Learning. By introducing a novel analysis of reservoirs as non-linear dynamical systems, new insights are gained and translated into a new reservoir design recipe that is extremely simple and highly comprehensible in terms of the dynamics of the acoustic features and the modeled acoustic units. By tuning the reservoir to these dynamics, one can create RC-based systems that not only compete well with conventional systems in clean conditions, but also degrade more gracefully in noisy conditions. ControlAbstract : Highlights: Study of robustness of Reservoir Computing (RC) based continuous digit recognizers. Discovery of new relations between RC control parameters, input and output dynamics. Use of these relations to find heuristics to reduce the reservoir development time. Creation of an RC-based recognizer that is more noise robust than the AFE-GMM-HMM. Abstract: It is acknowledged that Hidden Markov Models (HMMs) with Gaussian Mixture Models (GMMs) as the observation density functions achieve excellent digit recognition performance at high signal to noise ratios (SNRs). Moreover, many years of research have led to good techniques to reduce the impact of noise, distortion and mismatch between training and test conditions on the recognition accuracy. Nevertheless, we still await systems that are truly robust against these confounding factors. The present paper extends recent work on acoustic modeling based on Reservoir Computing (RC), a concept that has its roots in Machine Learning. By introducing a novel analysis of reservoirs as non-linear dynamical systems, new insights are gained and translated into a new reservoir design recipe that is extremely simple and highly comprehensible in terms of the dynamics of the acoustic features and the modeled acoustic units. By tuning the reservoir to these dynamics, one can create RC-based systems that not only compete well with conventional systems in clean conditions, but also degrade more gracefully in noisy conditions. Control experiments show that noise-robustness follows from the random fixation of the reservoir neurons whereas, tuning the reservoir dynamics increases the accuracy without compromising the noise-robustness. … (more)
- Is Part Of:
- Computer speech & language. Volume 30(2015)
- Journal:
- Computer speech & language
- Issue:
- Volume 30(2015)
- Issue Display:
- Volume 30, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 30
- Issue:
- 2015
- Issue Sort Value:
- 2015-0030-2015-0000
- Page Start:
- 135
- Page End:
- 158
- Publication Date:
- 2015-03
- Subjects:
- Reservoir Computing -- Recurrent Neural Networks -- Acoustic modeling -- Automatic Speech Recognition -- Noise robust spoken digit recognition
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2014.09.006 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5423.xml