Adversarial training and decoding strategies for end-to-end neural conversation models. (March 2019)
- Record Type:
- Journal Article
- Title:
- Adversarial training and decoding strategies for end-to-end neural conversation models. (March 2019)
- Main Title:
- Adversarial training and decoding strategies for end-to-end neural conversation models
- Authors:
- Hori, Takaaki
Wang, Wen
Koji, Yusuke
Hori, Chiori
Harsham, Bret
Hershey, John R. - Abstract:
- Highlights: An advanced end to end conversation system for the 6-th edition of Dialog System Technology Challenge (DSTC6). Applying sequence adversarial training with extension of the objective function to improve both objective and subjective evaluation metrics. Minimum Bayes risk (MBR) based system combination of multiple neural conversation models. Example-based response selection using an embedding-based context similarity. Thorough evaluation of three different neural conversation models, training techniques, and decoding strategies using a help-desk dialog task in DSTC6. Abstract: This paper presents adversarial training and decoding methods for neural conversation models that can generate natural responses given dialog contexts. In our prior work, we built several end-to-end conversation systems for the 6th Dialog System Technology Challenges (DSTC6) Twitter help-desk dialog task. These systems included novel extensions of sequence adversarial training, example-based response extraction, and Minimum Bayes-Risk based system combination. In DSTC6, our systems achieved the best performance in most objective measures such as BLEU and METEOR scores and decent performance in a subjective measure based on human rating. In this paper, we provide a complete set of our experiments for DSTC6 and further extend the training and decoding strategies more focusing on improving the subjective measure, where we combine responses of three adversarial models. Experimental resultsHighlights: An advanced end to end conversation system for the 6-th edition of Dialog System Technology Challenge (DSTC6). Applying sequence adversarial training with extension of the objective function to improve both objective and subjective evaluation metrics. Minimum Bayes risk (MBR) based system combination of multiple neural conversation models. Example-based response selection using an embedding-based context similarity. Thorough evaluation of three different neural conversation models, training techniques, and decoding strategies using a help-desk dialog task in DSTC6. Abstract: This paper presents adversarial training and decoding methods for neural conversation models that can generate natural responses given dialog contexts. In our prior work, we built several end-to-end conversation systems for the 6th Dialog System Technology Challenges (DSTC6) Twitter help-desk dialog task. These systems included novel extensions of sequence adversarial training, example-based response extraction, and Minimum Bayes-Risk based system combination. In DSTC6, our systems achieved the best performance in most objective measures such as BLEU and METEOR scores and decent performance in a subjective measure based on human rating. In this paper, we provide a complete set of our experiments for DSTC6 and further extend the training and decoding strategies more focusing on improving the subjective measure, where we combine responses of three adversarial models. Experimental results demonstrate that the extended methods improve the human rating score and outperform the best score in DSTC6. … (more)
- Is Part Of:
- Computer speech & language. Volume 54(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 54(2019)
- Issue Display:
- Volume 54, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 54
- Issue:
- 2019
- Issue Sort Value:
- 2019-0054-2019-0000
- Page Start:
- 122
- Page End:
- 139
- Publication Date:
- 2019-03
- Subjects:
- Dialog system -- Conversation model -- Sequence-to-sequence model -- Sentence generation
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2018.08.006 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8758.xml