Employing distance-based semantics to interpret spoken referring expressions. (November 2015)
- Record Type:
- Journal Article
- Title:
- Employing distance-based semantics to interpret spoken referring expressions. (November 2015)
- Main Title:
- Employing distance-based semantics to interpret spoken referring expressions
- Authors:
- Zukerman, Ingrid
Kim, Su Nam
Kleinbauer, Thomas
Moshtaghi, Masud - Abstract:
- Abstract : Highlights: An interpretation process that considers multiple alternatives. A mechanism for combining uncertainty from a variety of sources. Distance functions with probabilistic semantics that represent similarity measures. Two evaluation experiments to assess our system's performance. Abstract: In this paper, we present Scusi?, an anytime numerical mechanism for the interpretation of spoken referring expressions. Our contributions are: (1) an anytime interpretation process that considers multiple alternatives at different interpretation stages (speech, syntax, semantics and pragmatics), which enables Scusi? to defer decisions to the end of the interpretation process; (2) a mechanism that combines scores associated with the output of the different interpretation stages, taking into account the uncertainty arising from a variety of sources, such as ambiguity or inaccuracy in a description, speech recognition errors and out-of-vocabulary terms; and (3) distance-based functions with probabilistic semantics that represent lexical similarity between objects' names and similarity between stated requirements and physical properties of objects (viz colour, size and positional relations). We considered two approaches for combining these descriptive attributes, viz multiplicative and additive, and determined whether prioritizing certain interpretation stages and descriptive attributes affects interpretation performance. We conducted two experiments to evaluate differentAbstract : Highlights: An interpretation process that considers multiple alternatives. A mechanism for combining uncertainty from a variety of sources. Distance functions with probabilistic semantics that represent similarity measures. Two evaluation experiments to assess our system's performance. Abstract: In this paper, we present Scusi?, an anytime numerical mechanism for the interpretation of spoken referring expressions. Our contributions are: (1) an anytime interpretation process that considers multiple alternatives at different interpretation stages (speech, syntax, semantics and pragmatics), which enables Scusi? to defer decisions to the end of the interpretation process; (2) a mechanism that combines scores associated with the output of the different interpretation stages, taking into account the uncertainty arising from a variety of sources, such as ambiguity or inaccuracy in a description, speech recognition errors and out-of-vocabulary terms; and (3) distance-based functions with probabilistic semantics that represent lexical similarity between objects' names and similarity between stated requirements and physical properties of objects (viz colour, size and positional relations). We considered two approaches for combining these descriptive attributes, viz multiplicative and additive, and determined whether prioritizing certain interpretation stages and descriptive attributes affects interpretation performance. We conducted two experiments to evaluate different aspects of Scusi? 's performance: Interpretive, where we compared Scusi? 's understanding of descriptions that are mainly ambiguous or inaccurate with people's understanding of these descriptions, and Generative, where we assessed Scusi? 's understanding of naturally occurring spoken descriptions. Our results show that Scusi? 's understanding of the descriptions in the Interpretive trial is comparable to that of people; and that its performance is encouraging when given arbitrary spoken descriptions in diverse scenarios, and excellent for the corresponding written descriptions. In both experiments, Scusi? significantly outperformed a baseline system that maintains only top same-score interpretations. … (more)
- Is Part Of:
- Computer speech & language. Volume 34(2015)
- Journal:
- Computer speech & language
- Issue:
- Volume 34(2015)
- Issue Display:
- Volume 34, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 34
- Issue:
- 2015
- Issue Sort Value:
- 2015-0034-2015-0000
- Page Start:
- 154
- Page End:
- 185
- Publication Date:
- 2015-11
- Subjects:
- Spoken language understanding -- Numerical approach -- Semantic interpretation -- Distance-based semantics -- Performance evaluation
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2015.01.002 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6446.xml