Building a large-scale testing dataset for conceptual semantic annotation of text. (2018)