Back to Search
Start Over
Instant Translation Model Adaptation by Translating Unseen Words in Continuous Vector Space
- Source :
- Computational Linguistics and Intelligent Text Processing ISBN: 9783319754864, CICLing (2)
- Publication Year :
- 2018
- Publisher :
- Springer International Publishing, 2018.
-
Abstract
- In statistical machine translation (smt), differences between domains of training and test data result in poor translations. Although there have been many studies on domain adaptation of language models and translation models, most require supervised in-domain language resources such as parallel corpora for training and tuning the models. The necessity of supervised data has made such methods difficult to adapt to practical smt systems. We thus propose a novel method that adapts translation models without in-domain parallel corpora. Our method infers translation candidates of unseen words by nearest-neighbor search after projecting their vector-based semantic representations to the semantic space of the target language. In our experiment of out-of-domain translation from Japanese to English, our method improved bleu score by 0.5–1.5.
- Subjects :
- Domain adaptation
Machine translation
business.industry
Computer science
02 engineering and technology
Translation (geometry)
computer.software_genre
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Language model
Artificial intelligence
business
Adaptation (computer science)
computer
Natural language processing
Instant
Test data
Vector space
Subjects
Details
- ISBN :
- 978-3-319-75486-4
- ISBNs :
- 9783319754864
- Database :
- OpenAIRE
- Journal :
- Computational Linguistics and Intelligent Text Processing ISBN: 9783319754864, CICLing (2)
- Accession number :
- edsair.doi...........594e8810c6b668486e12c448177a38b8