A three-stage neural model for Arabic Dialect Identification. (May 2023)
- Record Type:
- Journal Article
- Title:
- A three-stage neural model for Arabic Dialect Identification. (May 2023)
- Main Title:
- A three-stage neural model for Arabic Dialect Identification
- Authors:
- Mohammed, Abdelmajeed
Jiangbin, Zheng
Murtadha, Ahmed - Abstract:
- Abstract: The Arabic language has several dialects across the twenty-two Arabic-speaking countries in Asia and Africa. Arabic Dialect Identification (ADI) is still a challenging task due to the well-recognized complexity and variations of Arabic dialects. It is noteworthy that Arabic dialects share the majority of tokens. The state-of-the-art solutions have been built upon various machine learning approaches. However, they commonly treat all words equally-likely and thus ignores the importance of dialectal words in response to a given dialect. In this paper, we propose a three-stage neural approach to learn the dialectal semantic representation from a given corpus. Specifically, we first aim to capture the dialect-relevant information, which is then used to model the dialectal vector representation. The goal is to filter away the shared words between dialects to reduce the noisy information fused to the fully connected layer. We introduce two variants, including LSTM-based and Transformer-based. Finally, we empirically evaluate the performance of the proposed solution by a comparative study on real benchmark datasets, including MADAR, NADI, and QADI. Our extensive experiments show that it consistently achieves state-of-the-art performance. Due to the well-recognized challenging of ADI, the improvement margins can be deemed considerable. The code is available on GitHub . 1
- Is Part Of:
- Computer speech & language. Volume 80(2023)
- Journal:
- Computer speech & language
- Issue:
- Volume 80(2023)
- Issue Display:
- Volume 80, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 80
- Issue:
- 2023
- Issue Sort Value:
- 2023-0080-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-05
- Subjects:
- Dialect identification -- Natural language processing -- Arabic dialect identification -- Deep neural networks
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2023.101488 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26156.xml