Cite

HARVARD Citation

    Su, P. et al. (2018). Reward estimation for dialogue policy optimisation. Computer speech & language. pp. 24-43. [Online]. 
  
Back to record