Estimating post-editing time using a gold-standard set of machine translation errors. (May 2019)

Record Type:: Journal Article
Title:: Estimating post-editing time using a gold-standard set of machine translation errors. (May 2019)
Main Title:: Estimating post-editing time using a gold-standard set of machine translation errors
Authors:: Tezcan, Arda
Hoste, Véronique
Macken, Lieve
Abstract:: Highlights: An informative two-step machine translation quality estimation approach is proposed. Gold-standard error annotations are used to define an upper boundary for such a system. Post-editing time can be estimated successfully using the gold standard data. The predictive power of MT error types on post-editing time is investigated. Post-editing time can be estimated adequately by using a small set of MT error types. Abstract: With the improved quality of Machine Translation (MT) systems in the last decades, post-editing (the correction of MT errors) has gained importance in Computer-Assisted Translation (CAT) workflows. Depending on the number and the severity of the errors in the MT output, the effort required to post-edit varies from sentence to sentence. The existing Quality Estimation (QE) systems provide quality scores that reflect the quality of an MT output at sentence level or word level. However, they fail to explain the relationship between different types of MT errors and the required post-editing effort to correct them. We suggest a more informative approach to QE in which different types of MT errors are detected in a first step, which are then used to estimate post-editing effort in a second step. In this paper we define the upper boundary of such a system. We use different machine learning methods to estimate Post-Editing Time (PET) by using a gold-standard set of MT errors as features. We show that post-editing time can be estimated with high accuracy … (more)
Is Part Of:: Computer speech & language. Volume 55(2019)
Journal:: Computer speech & language
Issue:: Volume 55(2019)
Issue Display:: Volume 55, Issue 2019 (2019)
Year:: 2019
Volume:: 55
Issue:: 2019
Issue Sort Value:: 2019-0055-2019-0000
Page Start:: 120
Page End:: 144
Publication Date:: 2019-05
Subjects:: Machine translation -- Quality estimation -- Post-editing -- Machine learning -- Feature selection
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2018.10.005 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 9439.xml