Situated reference resolution using visual saliency and crowdsourcing-based priors for a spoken dialog system within vehicles. (March 2018)
- Record Type:
- Journal Article
- Title:
- Situated reference resolution using visual saliency and crowdsourcing-based priors for a spoken dialog system within vehicles. (March 2018)
- Main Title:
- Situated reference resolution using visual saliency and crowdsourcing-based priors for a spoken dialog system within vehicles
- Authors:
- Misu, Teruhisa
- Abstract:
- Abstract: In this paper, we address issues in situated language understanding in a moving car. More specifically, we propose a reference resolution method to identify user queries about specific target objects in their surroundings. We investigate methods of predicting which target object is likely to be queried given a visual scene and what kind of linguistic cues users naturally provide to describe a given target object in a situated environment. We propose methods to incorporate the visual saliency of the visual scene as a prior. Crowdsourced statistics of how people describe an object are also used as a prior. We have collected situated utterances from drivers using our research system, which was embedded in a real vehicle. We demonstrate that the proposed algorithms improve target identification rate by 15.1% absolute over the baseline method that does not use visual saliency-based prior and depends on public database with a limited number of category information.
- Is Part Of:
- Computer speech & language. Volume 48(2018)
- Journal:
- Computer speech & language
- Issue:
- Volume 48(2018)
- Issue Display:
- Volume 48, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 48
- Issue:
- 2018
- Issue Sort Value:
- 2018-0048-2018-0000
- Page Start:
- 1
- Page End:
- 14
- Publication Date:
- 2018-03
- Subjects:
- Situated dialog -- In-car interaction -- Visual saliency -- Crowdsourcing -- Multimodal interaction
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2017.09.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5454.xml