Information density and overlap in spoken dialogue. (May 2016)
- Record Type:
- Journal Article
- Title:
- Information density and overlap in spoken dialogue. (May 2016)
- Main Title:
- Information density and overlap in spoken dialogue
- Authors:
- Dethlefs, Nina
Hastie, Helen
Cuayáhuitl, Heriberto
Yu, Yanchao
Rieser, Verena
Lemon, Oliver - Abstract:
- Abstract : Highlights: Information density, related to entropy, is related to overlaps in spoken language. Humans prefer overlaps based on information density and suprasegmental features. This is confirmed in a speech-based rating study ( p < 0.0001). Our results are relevant for spoken dialogue systems, especially incremental ones. Abstract: Incremental dialogue systems are often perceived as more responsive and natural because they are able to address phenomena of turn-taking and overlapping speech, such as backchannels or barge-ins. Previous work in this area has often identified distinctive prosodic features, or features relating to syntactic or semantic completeness, as marking appropriate places of turn-taking. In a separate strand of work, psycholinguistic studies have established a connection between information density and prominence in language—the less expected a linguistic unit is in a particular context, the more likely it is to be linguistically marked. This has been observed across linguistic levels, including the prosodic, which plays an important role in predicting overlapping speech. In this article, we explore the hypothesis that information density (ID) also plays a role in turn-taking. Specifically, we aim to show that humans are sensitive to the peaks and troughs of information density in speech, and that overlapping speech at ID troughs is perceived as more acceptable than overlaps at ID peaks. To test our hypothesis, we collect human ratings forAbstract : Highlights: Information density, related to entropy, is related to overlaps in spoken language. Humans prefer overlaps based on information density and suprasegmental features. This is confirmed in a speech-based rating study ( p < 0.0001). Our results are relevant for spoken dialogue systems, especially incremental ones. Abstract: Incremental dialogue systems are often perceived as more responsive and natural because they are able to address phenomena of turn-taking and overlapping speech, such as backchannels or barge-ins. Previous work in this area has often identified distinctive prosodic features, or features relating to syntactic or semantic completeness, as marking appropriate places of turn-taking. In a separate strand of work, psycholinguistic studies have established a connection between information density and prominence in language—the less expected a linguistic unit is in a particular context, the more likely it is to be linguistically marked. This has been observed across linguistic levels, including the prosodic, which plays an important role in predicting overlapping speech. In this article, we explore the hypothesis that information density (ID) also plays a role in turn-taking. Specifically, we aim to show that humans are sensitive to the peaks and troughs of information density in speech, and that overlapping speech at ID troughs is perceived as more acceptable than overlaps at ID peaks. To test our hypothesis, we collect human ratings for three models of generating overlapping speech based on features of: (1) prosody and semantic or syntactic completeness, (2) information density, and (3) both types of information. Results show that over 50% of users preferred the version using both types of features, followed by a preference for information density features alone. This indicates a clear human sensitivity to the effects of information density in spoken language and provides a strong motivation to adopt this metric for the design, development and evaluation of turn-taking modules in spoken and incremental dialogue systems. … (more)
- Is Part Of:
- Computer speech & language. Volume 37(2016)
- Journal:
- Computer speech & language
- Issue:
- Volume 37(2016)
- Issue Display:
- Volume 37, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 37
- Issue:
- 2016
- Issue Sort Value:
- 2016-0037-2016-0000
- Page Start:
- 82
- Page End:
- 97
- Publication Date:
- 2016-05
- Subjects:
- Overlap -- Turn-taking -- Information density -- Incremental processing -- Spoken dialogue systems
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2015.11.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1216.xml