An experimental study measuring human annotator categorization agreement on commonsense sentences. (18th June 2021)