Back to Search Start Over

A linguistically motivated taxonomy for Machine Translation error analysis

Authors :
Rui Correia
Ângela Costa
Luísa Coheur
Tiago Luís
Wang Ling
Centro de Linguística da UNL (CLUNL)
Source :
Repositório Científico de Acesso Aberto de Portugal, Repositório Científico de Acesso Aberto de Portugal (RCAAP), instacron:RCAAP
Publication Year :
2015

Abstract

UID/LIN/03213/2013 SFRH/BD/85737/2012 SFRH/BD/51157/2010 SFRH/BD/51156/2010 A detailed error analysis is a fundamental step in every natural lan- guage processing task, as to be able to diagnosis what went wrong will provide cues to decide which are the research directions to be followed. In this paper we focus on error analysis in Machine Translation. We deeply extend previous error taxonomies so that translation errors associated with Romance languages speci- ficities can be accommodated. Also, based on the proposed taxonomy, we carry out an extensive analysis of the errors generated by four di↵erent systems: two mainstream online translation systems Google Translate (Statistical) and Systran (Hybrid Machine Translation) and two in-house Machine Translation systems, in three scenarios representing di↵erent challenges in the translation from English to European Portuguese. Additionally, we comment on how distinct error types di↵erently impact translation quality. publishersversion published

Details

Language :
English
Database :
OpenAIRE
Journal :
Repositório Científico de Acesso Aberto de Portugal, Repositório Científico de Acesso Aberto de Portugal (RCAAP), instacron:RCAAP
Accession number :
edsair.doi.dedup.....3ae260b707e08c939af1975db1dfed29