Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results. (January 2023)
- Record Type:
- Journal Article
- Title:
- Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results. (January 2023)
- Main Title:
- Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results
- Authors:
- Moradi, Ashkan
Shekofteh, Yasser - Abstract:
- Abstract: Identification of the spoken languages in an audio file is performed automatically using the spoken language identification (LID) process. In this paper, we proposed a genetic-based fusion method to combine the score probabilities of an x-vector-based acoustic LID (ALID) and a phonetic LID (PLID) system. The ALID system is based on an LDA classifier able to identify different languages using x-vectors, while the PLID system is based on an SVM classifier which takes into account perplexities as its feature vector, which are derived from phone language models utilizing a universal phone recognizer named Allosaurus. With the help of genetic-based fusion, 54 weights will be extracted. Having 27 languages in our database and two different LID systems results in 54 weights for our fusion. The individual results of our acoustic and phonetic LID systems are eventually combined by applying these weights. Based on the experimental results on 27 languages from the NIST-LRE09 database, the fusion of the acoustic system and the phonetic system results in 93.30% accuracy, which has approximately a 21% reduction in identification error to our best baseline system with 91.50% accuracy. Graphical abstract: Highlights: A spoken language identification (LID) system is proposed based on Allosaurus. A new genetic-based method is used to combine acoustic and phonetic LID systems. Results of LID systems are investigated on the language pairs.
- Is Part Of:
- Computers & electrical engineering. Volume 105(2023)
- Journal:
- Computers & electrical engineering
- Issue:
- Volume 105(2023)
- Issue Display:
- Volume 105, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 105
- Issue:
- 2023
- Issue Sort Value:
- 2023-0105-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-01
- Subjects:
- Spoken language identification -- x-vectors -- Acoustic-based approach -- Phonetic-based approach -- Universal phone recognition -- Classifier fusion -- Genetic algorithm
Computer engineering -- Periodicals
Electrical engineering -- Periodicals
Electrical engineering -- Data processing -- Periodicals
Ordinateurs -- Conception et construction -- Périodiques
Électrotechnique -- Périodiques
Électrotechnique -- Informatique -- Périodiques
Computer engineering
Electrical engineering
Electrical engineering -- Data processing
Periodicals
Electronic journals
621.302854 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00457906/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compeleceng.2022.108549 ↗
- Languages:
- English
- ISSNs:
- 0045-7906
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.680000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25029.xml