An analysis of observation length requirements for machine understanding of human behaviors from spoken language. (March 2021)
- Record Type:
- Journal Article
- Title:
- An analysis of observation length requirements for machine understanding of human behaviors from spoken language. (March 2021)
- Main Title:
- An analysis of observation length requirements for machine understanding of human behaviors from spoken language
- Authors:
- Nallan Chakravarthula, Sandeep
Baucom, Brian R.W.
Narayanan, Shrikanth
Georgiou, Panayiotis - Abstract:
- Highlights: Accuracy of human behavior estimation from conversational language depends on length of window used by system to observe language cues Proposed framework analyzes (1) extrinsic similarity between system predictions and human judgments and (2) intrinsic consistency between system predictions of related behaviors to determine appropriate observation window lengths for different behaviors Behaviors related to negative and positive affect best captured from short, frequently occurring expressions Behaviors describing problem-solving traits such as negotiation, responsibility, etc. require longer observations for accurate estimation Behaviors related to dysphoria such as sadness, withdrawal, etc. exhibit impulsive characteristics, difficult to capture from language cues alone Abstract: The task of quantifying human behavior by observing interaction cues is an important and useful one across a range of domains in psychological research and practice. Machine learning-based approaches typically perform this task by first estimating behavior based on cues within an observation window, such as a fixed number of words, and then aggregating the behavior over all the windows in that interaction. The length of this window directly impacts the accuracy of estimation by controlling the amount of information being used. The exact link between window length and accuracy, however, has not been well studied, especially in spoken language. In this paper, we investigate this link andHighlights: Accuracy of human behavior estimation from conversational language depends on length of window used by system to observe language cues Proposed framework analyzes (1) extrinsic similarity between system predictions and human judgments and (2) intrinsic consistency between system predictions of related behaviors to determine appropriate observation window lengths for different behaviors Behaviors related to negative and positive affect best captured from short, frequently occurring expressions Behaviors describing problem-solving traits such as negotiation, responsibility, etc. require longer observations for accurate estimation Behaviors related to dysphoria such as sadness, withdrawal, etc. exhibit impulsive characteristics, difficult to capture from language cues alone Abstract: The task of quantifying human behavior by observing interaction cues is an important and useful one across a range of domains in psychological research and practice. Machine learning-based approaches typically perform this task by first estimating behavior based on cues within an observation window, such as a fixed number of words, and then aggregating the behavior over all the windows in that interaction. The length of this window directly impacts the accuracy of estimation by controlling the amount of information being used. The exact link between window length and accuracy, however, has not been well studied, especially in spoken language. In this paper, we investigate this link and present an analysis framework that determines appropriate window lengths for the task of behavior estimation. Our proposed framework utilizes a two-pronged evaluation approach: (a) extrinsic similarity between machine predictions and human expert annotations, and (b) intrinsic consistency between intra-machine and intra-human behavior relations. We apply our analysis to real-life conversations that are annotated for a large and diverse set of behavior codes and examine the relation between the nature of a behavior and how long it should be observed. We find that behaviors describing negative and positive affect can be accurately estimated from short to medium-length expressions whereas behaviors related to problem-solving and dysphoria require much longer observations and are difficult to quantify from language alone. These findings are found to be generally consistent across different behavior modeling approaches. … (more)
- Is Part Of:
- Computer speech & language. Volume 66(2021)
- Journal:
- Computer speech & language
- Issue:
- Volume 66(2021)
- Issue Display:
- Volume 66, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 66
- Issue:
- 2021
- Issue Sort Value:
- 2021-0066-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-03
- Subjects:
- Human behavior -- Spoken language -- Observation window length -- Machine learning
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2020.101162 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15413.xml