Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review. (March 2022)
- Record Type:
- Journal Article
- Title:
- Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review. (March 2022)
- Main Title:
- Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review
- Authors:
- Rezazadegan, Dana
Berkovsky, Shlomo
Quiroz, Juan C.
Kocaballi, A. Baki
Wang, Ying
Laranjo, Liliana
Coiera, Enrico - Abstract:
- Highlights: Mapping the speech summarization literature, with no time frame, language or method restrictions A review on 110 papers reflecting utilized speech features, methods, scope, and training corpora Finding 4 speech summarization architectures based on similar patterns in their methodologies Supervised methods performed better than unsupervised methods, though attracted less interest Recent research into unsupervised methods focusses on extending language modelling ABSTRACT: Speech summarization techniques take human speech as input and then output an abridged version as text or speech. Speech summarization has applications in many domains from information technology to health care, for example improving speech archives or reducing clinical documentation burden. This scoping review maps close to 2 decades of speech summarization literature, spanning from the early machine learning works up to ensemble models, with no restrictions on the language summarized, research method, or paper type. We reviewed a total of 110 papers out of a set of 188 found through a literature search and extracted speech features used, methods, scope, and training corpora. Most studies employ one of four speech summarization architectures: (1) Sentence extraction and compaction; (2) Feature extraction and classification or rank-based sentence selection; (3) Sentence compression and compression summarization; and (4) Language modelling. We also discuss the strengths and weaknesses of theseHighlights: Mapping the speech summarization literature, with no time frame, language or method restrictions A review on 110 papers reflecting utilized speech features, methods, scope, and training corpora Finding 4 speech summarization architectures based on similar patterns in their methodologies Supervised methods performed better than unsupervised methods, though attracted less interest Recent research into unsupervised methods focusses on extending language modelling ABSTRACT: Speech summarization techniques take human speech as input and then output an abridged version as text or speech. Speech summarization has applications in many domains from information technology to health care, for example improving speech archives or reducing clinical documentation burden. This scoping review maps close to 2 decades of speech summarization literature, spanning from the early machine learning works up to ensemble models, with no restrictions on the language summarized, research method, or paper type. We reviewed a total of 110 papers out of a set of 188 found through a literature search and extracted speech features used, methods, scope, and training corpora. Most studies employ one of four speech summarization architectures: (1) Sentence extraction and compaction; (2) Feature extraction and classification or rank-based sentence selection; (3) Sentence compression and compression summarization; and (4) Language modelling. We also discuss the strengths and weaknesses of these different methods and speech features. Overall, supervised methods (e.g. Hidden Markov support vector machines, Ranking support vector machines, Conditional random fields) performed better than unsupervised methods. As supervised methods require manually annotated training data which can be costly, there was more interest in unsupervised methods. Recent research into unsupervised methods focusses on extending language modelling, for example by combining Uni-gram modelling with deep neural networks. This review does not include recent work in deep learning. … (more)
- Is Part Of:
- Computer speech & language. Volume 72(2022)
- Journal:
- Computer speech & language
- Issue:
- Volume 72(2022)
- Issue Display:
- Volume 72, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 72
- Issue:
- 2022
- Issue Sort Value:
- 2022-0072-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-03
- Subjects:
- Speech summarization -- Spontaneous speech -- Automatic speech recognition -- Extractive summarization -- Abstractive summarization
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2021.101305 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20100.xml