Back to Search Start Over

On the Influence of Machine Translation on Language Origin Obfuscation

Authors :
Murauer, Benjamin
Tschuggnall, Michael
Specht, Günther
Murauer, Benjamin
Tschuggnall, Michael
Specht, Günther
Publication Year :
2021

Abstract

In the last decade, machine translation has become a popular means to deal with multilingual digital content. By providing higher quality translations, obfuscating the source language of a text becomes more attractive. In this paper, we analyze the ability to detect the source language from the translated output of two widely used commercial machine translation systems by utilizing machine-learning algorithms with basic textual features like n-grams. Evaluations show that the source language can be reconstructed with high accuracy for documents that contain a sufficient amount of translated text. In addition, we analyze how the document size influences the performance of the prediction, as well as how limiting the set of possible source languages improves the classification accuracy.<br />Comment: This was peer-reviewed, accepted and presented at https://www.cicling.org/2018/, but the organizer somehow failed to publish the proceedings

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1269560339
Document Type :
Electronic Resource