Quantifying the impact of dirty OCR on historical text analysis: Eighteenth Century Collections Online as a case study. (22nd April 2019)