The translator's visibility: Detecting translatorial fingerprints in contemporaneous parallel translations. (November 2018)
- Record Type:
- Journal Article
- Title:
- The translator's visibility: Detecting translatorial fingerprints in contemporaneous parallel translations. (November 2018)
- Main Title:
- The translator's visibility: Detecting translatorial fingerprints in contemporaneous parallel translations
- Authors:
- Lynch, Gerard
Vogel, Carl - Abstract:
- Highlights: We examine the stylistic fingerprint of two sets of literary translators. We use machine learning techniques such as SVMs, Nave Bayes and Simple Logistic Regression which are relatively novel approaches to the task. We find that common words and bigrams are strong discriminators of translators style based on two authors, who original wrote in Russian and Norwegian and were translated to English. We find that translators style remains discriminable across non-parallel translations of the same author. We also find that textual metrics such as common verb frequencies are strong indicators of translators style in both cases, in line with prior studies in the literature. Abstract: We detail the results of experiments towards a fine-grained stylometric analysis, the identification of distinguishing features between contemporaneous literary translations, both parallel works and also translations of non-parallel sets of works by the same author. We examine translations of plays by the Norwegian dramatist Henrik Ibsen with the initial point of focus being the Ibsen drama Ghosts, for which there exists comparable contemporaneous translations by R. Farqhuarson Sharp and William Archer. Consequently, a number of prose translations of Russian author Anton Chekhov by Marian Fell and Constance Garnett are examined in order to validate hypotheses formed from the results of the Ibsen study and investigate possible particularities in translator's style which may vary according toHighlights: We examine the stylistic fingerprint of two sets of literary translators. We use machine learning techniques such as SVMs, Nave Bayes and Simple Logistic Regression which are relatively novel approaches to the task. We find that common words and bigrams are strong discriminators of translators style based on two authors, who original wrote in Russian and Norwegian and were translated to English. We find that translators style remains discriminable across non-parallel translations of the same author. We also find that textual metrics such as common verb frequencies are strong indicators of translators style in both cases, in line with prior studies in the literature. Abstract: We detail the results of experiments towards a fine-grained stylometric analysis, the identification of distinguishing features between contemporaneous literary translations, both parallel works and also translations of non-parallel sets of works by the same author. We examine translations of plays by the Norwegian dramatist Henrik Ibsen with the initial point of focus being the Ibsen drama Ghosts, for which there exists comparable contemporaneous translations by R. Farqhuarson Sharp and William Archer. Consequently, a number of prose translations of Russian author Anton Chekhov by Marian Fell and Constance Garnett are examined in order to validate hypotheses formed from the results of the Ibsen study and investigate possible particularities in translator's style which may vary according to genre. By carrying out an analysis of these texts using a variety of machine learning approaches such as Support Vector Machines, Simple Logistic Regression, Naïve Bayes and Decision Tree classifiers, a number of distinguishing textual features are obtained, and the relative frequency of these features in the texts are compared to their frequencies in reference corpora in order to establish which features can be attributed to stylistic choices by the translators themselves and which features may be due to influence from the source language or the topic or genre of a text. We also use the popular Delta metric from authorship attribution studies to investigate the clustering of texts based on most frequent words and a list of discriminatory terms learned in the supervised machine learning experiments. We find that common word unigrams and bigrams are the most salient features for translator fingerprinting across our two authors and four translators examined and are ultimately successful in our goal of classifying which text originated from a particular translator with accuracy measurements of over 90% on average. … (more)
- Is Part Of:
- Computer speech & language. Volume 52(2018)
- Journal:
- Computer speech & language
- Issue:
- Volume 52(2018)
- Issue Display:
- Volume 52, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 52
- Issue:
- 2018
- Issue Sort Value:
- 2018-0052-2018-0000
- Page Start:
- 79
- Page End:
- 104
- Publication Date:
- 2018-11
- Subjects:
- Stylometry -- Authorship -- Translation
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2018.05.002 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17055.xml