A Language Identification System using Hybrid Features and Back-Propagation Neural Network. (July 2020)
- Record Type:
- Journal Article
- Title:
- A Language Identification System using Hybrid Features and Back-Propagation Neural Network. (July 2020)
- Main Title:
- A Language Identification System using Hybrid Features and Back-Propagation Neural Network
- Authors:
- Deshwal, Deepti
Sangwan, Pardeep
Kumar, Divya - Abstract:
- Abstract: Language Identification (LID) is accurate identification of the unknown language by comparison of speech biometrics of test speech sample and language models accumulated beforehand. This paper presents and encourages the use of hybrid robust feature extraction techniques for spoken language identification (LID) system. In the feature extraction stage, different techniques are applied individually such as Mel frequency cepstral coefficients (MFCCs), perceptual linear prediction features (PLP), relative perceptual linear prediction features (RASTA-PLP). Later, performance of our LID system based on several combinations of the different features (hybrid features) are investigated such as MFCC, PLP, combined with their 1st order derivatives, MFCC + RASTA-PLP, MFCC + SDC (Shifted delta cepstral coefficients). Language identification phase or classification utilizes feed forward back-propagation neural network (FFBPNN) and comparison is based on two learning algorithms: the Levenberg–Marquardt "trainlm" and the scaled conjugate gradient "trainscg". A comparative analysis in terms of performance is done between different hybrid feature extraction techniques and their individual counterparts. Results clearly indicates that improved performance is obtained with hybrid features with "trainlm" learning algorithm as compared to their individual counterparts. The results are very promising with MFCC-RASTA-PLP hybrid feature extraction technique in comparison to the other hybridAbstract: Language Identification (LID) is accurate identification of the unknown language by comparison of speech biometrics of test speech sample and language models accumulated beforehand. This paper presents and encourages the use of hybrid robust feature extraction techniques for spoken language identification (LID) system. In the feature extraction stage, different techniques are applied individually such as Mel frequency cepstral coefficients (MFCCs), perceptual linear prediction features (PLP), relative perceptual linear prediction features (RASTA-PLP). Later, performance of our LID system based on several combinations of the different features (hybrid features) are investigated such as MFCC, PLP, combined with their 1st order derivatives, MFCC + RASTA-PLP, MFCC + SDC (Shifted delta cepstral coefficients). Language identification phase or classification utilizes feed forward back-propagation neural network (FFBPNN) and comparison is based on two learning algorithms: the Levenberg–Marquardt "trainlm" and the scaled conjugate gradient "trainscg". A comparative analysis in terms of performance is done between different hybrid feature extraction techniques and their individual counterparts. Results clearly indicates that improved performance is obtained with hybrid features with "trainlm" learning algorithm as compared to their individual counterparts. The results are very promising with MFCC-RASTA-PLP hybrid feature extraction technique in comparison to the other hybrid feature extraction techniques with overall accuracy of 94.6% and a minimum test error rate of 0.10. The efficiency of proposed hybrid approaches is determined by simulating several experiments on a user defined language database of speech signals in the working platform of MATLAB. … (more)
- Is Part Of:
- Applied acoustics. Volume 164(2020)
- Journal:
- Applied acoustics
- Issue:
- Volume 164(2020)
- Issue Display:
- Volume 164, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 164
- Issue:
- 2020
- Issue Sort Value:
- 2020-0164-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-07
- Subjects:
- Language identification -- Hybrid feature extraction techniques -- Mel-frequency cepstral coefficient -- Feed-forward back propagation neural network
Acoustical engineering -- Periodicals
Periodicals
620.2 - Journal URLs:
- http://www.sciencedirect.com/science/journal/0003682X ↗
http://www.elsevier.com/journals ↗
http://www.elsevier.com/homepage/elecserv.htt ↗ - DOI:
- 10.1016/j.apacoust.2020.107289 ↗
- Languages:
- English
- ISSNs:
- 0003-682X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1571.400000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13457.xml