Cite

HARVARD Citation

    Kim, J. et al. (2021). Visual question answering based on local-scene-aware referring expression generation. Neural networks. pp. 158-167. [Online]. 
  
Back to record