Translating noun compounds using semantic relations. (July 2015)
- Record Type:
- Journal Article
- Title:
- Translating noun compounds using semantic relations. (July 2015)
- Main Title:
- Translating noun compounds using semantic relations
- Authors:
- Balyan, Renu
Chatterjee, Niladri - Abstract:
- Highlights: A hybrid scheme for noun compound translation from English to Hindi. A scheme for semantic relation identification is proposed for 2-word noun compounds. A scheme for generation of translation pattern(s) for 2/3/4-word noun compounds. Bracketing scheme for 4-word noun compounds. Results of the proposed system compared with the state-of-the-art translators. Abstract: Despite having a research history of more than 20 years, English to Hindi machine translation often suffers badly from incorrect translations of noun compounds. The problems envisaged can be of various types, such as, the absence of proper postpositions, inappropriate word order, incorrect semantics. Different existing English to Hindi machine translation systems show their vulnerability, irrespective of the underlying technique. A potential solution to this problem lies in understanding the semantics of the noun compounds. The present paper proposes a scheme based on semantic relations to address this issue. The scheme works in three steps: identification of the noun compounds in a given text, determination of the semantic relationship(s) between them, and finally, selecting the right translation pattern. The scheme provides translation patterns for different semantic relations for 2-word noun compounds first. These patterns are used recursively to find the semantic relations and the translation patterns for 3-word and 4-word noun compounds. Frequency and probability based adjacency and dependencyHighlights: A hybrid scheme for noun compound translation from English to Hindi. A scheme for semantic relation identification is proposed for 2-word noun compounds. A scheme for generation of translation pattern(s) for 2/3/4-word noun compounds. Bracketing scheme for 4-word noun compounds. Results of the proposed system compared with the state-of-the-art translators. Abstract: Despite having a research history of more than 20 years, English to Hindi machine translation often suffers badly from incorrect translations of noun compounds. The problems envisaged can be of various types, such as, the absence of proper postpositions, inappropriate word order, incorrect semantics. Different existing English to Hindi machine translation systems show their vulnerability, irrespective of the underlying technique. A potential solution to this problem lies in understanding the semantics of the noun compounds. The present paper proposes a scheme based on semantic relations to address this issue. The scheme works in three steps: identification of the noun compounds in a given text, determination of the semantic relationship(s) between them, and finally, selecting the right translation pattern. The scheme provides translation patterns for different semantic relations for 2-word noun compounds first. These patterns are used recursively to find the semantic relations and the translation patterns for 3-word and 4-word noun compounds. Frequency and probability based adjacency and dependency models are used for bracketing (grouping) the constituent words of 3-word and 4-word noun compounds into 2-word noun compounds. The semantic relations and the translation patterns generated for 2-word, 3-word and 4-word noun compounds are evaluated. The proposed scheme is compared with some well-known English to Hindi translators, viz. AnglaMT, Anuvadaksh, Bing, Google, and also with the Moses baseline system. The results obtained, show significant improvement over the Moses baseline system. Also, it performs better than the other online MT systems in terms of recall and precision. … (more)
- Is Part Of:
- Computer speech & language. Volume 32(2015)
- Journal:
- Computer speech & language
- Issue:
- Volume 32(2015)
- Issue Display:
- Volume 32, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 32
- Issue:
- 2015
- Issue Sort Value:
- 2015-0032-2015-0000
- Page Start:
- 91
- Page End:
- 108
- Publication Date:
- 2015-07
- Subjects:
- Noun compounds -- Semantic relation -- Translation pattern -- Bracketing -- Machine translation
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2014.09.007 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5431.xml