Back to Search Start Over

Instant Translation Model Adaptation by Translating Unseen Words in Continuous Vector Space

Authors :
Naoki Yoshinaga
Shonosuke Ishiwatari
Masaru Kitsuregawa
Masashi Toyoda
Source :
Computational Linguistics and Intelligent Text Processing ISBN: 9783319754864, CICLing (2)
Publication Year :
2018
Publisher :
Springer International Publishing, 2018.

Abstract

In statistical machine translation (smt), differences between domains of training and test data result in poor translations. Although there have been many studies on domain adaptation of language models and translation models, most require supervised in-domain language resources such as parallel corpora for training and tuning the models. The necessity of supervised data has made such methods difficult to adapt to practical smt systems. We thus propose a novel method that adapts translation models without in-domain parallel corpora. Our method infers translation candidates of unseen words by nearest-neighbor search after projecting their vector-based semantic representations to the semantic space of the target language. In our experiment of out-of-domain translation from Japanese to English, our method improved bleu score by 0.5–1.5.

Details

ISBN :
978-3-319-75486-4
ISBNs :
9783319754864
Database :
OpenAIRE
Journal :
Computational Linguistics and Intelligent Text Processing ISBN: 9783319754864, CICLing (2)
Accession number :
edsair.doi...........594e8810c6b668486e12c448177a38b8