Generative adversarial networks for speech processing: A review. (March 2022)
- Record Type:
- Journal Article
- Title:
- Generative adversarial networks for speech processing: A review. (March 2022)
- Main Title:
- Generative adversarial networks for speech processing: A review
- Authors:
- Wali, Aamir
Alamgir, Zareen
Karim, Saira
Fawaz, Ather
Ali, Mubariz Barkat
Adan, Muhammad
Mujtaba, Malik - Abstract:
- Abstract: Generative adversarial networks (GANs) have seen remarkable progress in recent years. They are used as generative models for all kinds of data such as text, images, audio, music, videos, and animations. This paper presents a comprehensive review of the novel and emerging GAN-based speech frameworks and algorithms that have revolutionized speech processing. We have categorized speech GANs based on application areas: speech synthesis, speech enhancement & conversion, and data augmentation in automatic speech recognition and emotion speech recognition systems. This review also includes a summary of the data sets and evaluation metrics commonly used in speech GANs. We also suggest some interesting research directions for future work and highlight the issues faced by current state-of-the-art speech GANs. Highlights: Presents a comprehensive review of speech GANs. Categorizes speech GANs based on application areas: speech synthesis, speech enhancement, and model augmentation. This work lays the ground for future research in speech GANs. Summarizes the evaluation metrics and resources required for conducting fruitful research.
- Is Part Of:
- Computer speech & language. Volume 72(2022)
- Journal:
- Computer speech & language
- Issue:
- Volume 72(2022)
- Issue Display:
- Volume 72, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 72
- Issue:
- 2022
- Issue Sort Value:
- 2022-0072-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-03
- Subjects:
- GANs -- Speech synthesis -- Speech enhancement -- Data augmentation -- Speech GANs
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2021.101308 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20051.xml