Adjusting Bilingual Ratings by Retest Reliability Improves Estimation of Translation Quality. (October 2018)