DBMiP: A pre-training method for information propagation over deep networks. (May 2019)
- Record Type:
- Journal Article
- Title:
- DBMiP: A pre-training method for information propagation over deep networks. (May 2019)
- Main Title:
- DBMiP: A pre-training method for information propagation over deep networks
- Authors:
- Zoughi, Toktam
Homayounpour, Mohammad Mehdi - Abstract:
- Abstract: Deep neural networks (DNNs) have recently been successful in many applications and have become a popular approach for speech recognition. Training a DNN model for speech recognition is computationally expensive due to the model large number of parameters. Pre-training improves DNN modeling. However, DNN learning is challenging if pre-training is inefficient. This paper introduces a new framework for pre-training that utilizes label information in lower layers (layers near input) for better recognition. The proposed pre-training method dynamically inserts discriminative information not only in the last layer but also in other layers. In this algorithm, the lower layers achieve more generative information while the higher layers achieve more discriminative information. In addition, this method uses speaker information by employing the Subspace Gaussian Mixture Model (SGMM), which improves recognition accuracy. Experimental results on TIMIT, MNIST, Switchboard, and English Broadcast News datasets show that this approach significantly outperforms current state-of-the-art methods such as the Deep Belief Network and the Deep Boltzmann Machine. Moreover, the proposed algorithm has minimal memory requirements.
- Is Part Of:
- Computer speech & language. Volume 55(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 55(2019)
- Issue Display:
- Volume 55, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 55
- Issue:
- 2019
- Issue Sort Value:
- 2019-0055-2019-0000
- Page Start:
- 82
- Page End:
- 100
- Publication Date:
- 2019-05
- Subjects:
- Speech recognition -- Deep neural networks -- Deep boltzmann machine -- Pre-training -- Subspace gaussian mixture model
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2018.10.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9439.xml