Prosodic event detection in children's read speech. (July 2021)
- Record Type:
- Journal Article
- Title:
- Prosodic event detection in children's read speech. (July 2021)
- Main Title:
- Prosodic event detection in children's read speech
- Authors:
- Sabu, Kamini
Rao, Preeti - Abstract:
- Highlights: Data set of oral reading across L2 learner skill levels annotated for boundary and prominence. Acoustic features evaluated in speaker-independent and speaker-dependent testing. Trained model feature importances provide new insights related to speaking style. Abstract: Prosody is the supra-segmental aspect of speech that helps to convey the structure and intended meaning of lexical content unambiguously. The automatic detection of prosodic events, such as phrase boundary and word prominence, has a number of applications in discourse analysis, where a combination of syntactic and acoustic-prosodic features is typically employed. This work addresses prosodic event detection in the context of assessing oral reading skills of middle-school children. We discuss the observed characteristics of a specially created labeled data set of oral reading recordings of English stories by non-native speakers. The obtained diversity of language skills adds to the known challenges of high speaker variability in the acoustic realization of prosodic events. A combination of knowledge- and data-driven feature selection is implemented to identify a compact set of word-level features from the acoustic correlates of prosody considering different ways of incorporating the necessary temporal context. The system is benchmarked with reference to a widely known prosodic event recognition system in a speaker-independent set-up to obtain a competitive performance with greatly reduced featureHighlights: Data set of oral reading across L2 learner skill levels annotated for boundary and prominence. Acoustic features evaluated in speaker-independent and speaker-dependent testing. Trained model feature importances provide new insights related to speaking style. Abstract: Prosody is the supra-segmental aspect of speech that helps to convey the structure and intended meaning of lexical content unambiguously. The automatic detection of prosodic events, such as phrase boundary and word prominence, has a number of applications in discourse analysis, where a combination of syntactic and acoustic-prosodic features is typically employed. This work addresses prosodic event detection in the context of assessing oral reading skills of middle-school children. We discuss the observed characteristics of a specially created labeled data set of oral reading recordings of English stories by non-native speakers. The obtained diversity of language skills adds to the known challenges of high speaker variability in the acoustic realization of prosodic events. A combination of knowledge- and data-driven feature selection is implemented to identify a compact set of word-level features from the acoustic correlates of prosody considering different ways of incorporating the necessary temporal context. The system is benchmarked with reference to a widely known prosodic event recognition system in a speaker-independent set-up to obtain a competitive performance with greatly reduced feature dimensionality. The interpretable features enable us to use the predictor model importance scores to identify high-level speaker traits that influence the acoustic realization of prosodic events, suggesting a potential extension to systems that can extract and utilize speaker idiosyncrasies for superior prosodic event detection. … (more)
- Is Part Of:
- Computer speech & language. Volume 68(2021)
- Journal:
- Computer speech & language
- Issue:
- Volume 68(2021)
- Issue Display:
- Volume 68, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 68
- Issue:
- 2021
- Issue Sort Value:
- 2021-0068-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-07
- Subjects:
- Prosodic event -- Phrasing -- Prominence -- L2 prosody -- Non-native children's speech -- Literacy assessment
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2021.101200 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16008.xml