Image-Text Joint Learning for Social Images with Spatial Relation Model. (28th March 2020)