Cite
HARVARD Citation
Su, P. et al. (2018). Reward estimation for dialogue policy optimisation. Computer speech & language. pp. 24-43. [Online].
This is an interim version of our Electronic Legal Deposit Catalogue-eJournals and eBooks while we continue to recover from a cyber-attack.
Su, P. et al. (2018). Reward estimation for dialogue policy optimisation. Computer speech & language. pp. 24-43. [Online].