Back to Search
Start Over
Ensemble Distilling Pretrained Language Models for Machine Translation Quality Estimation
- Source :
- Natural Language Processing and Chinese Computing ISBN: 9783030604561, NLPCC (2)
- Publication Year :
- 2020
- Publisher :
- Springer International Publishing, 2020.
-
Abstract
- Machine translation quality estimation (Quality Estimation, QE) aims to evaluate the quality of machine translation automatically without golden reference. QE can be implemented on different granularities, thus to give an estimation for different aspects of machines translation output. In this paper, we propose an effective method to utilize pretrained language models to improve the performance of QE. Our model combines two popular pretrained models, which are Bert and XLM, to create a very strong baseline for both sentence-level and word-level QE. We also propose a simple yet effective strategy, ensemble distillation, to further improve the accuracy of QE system. Ensemble distillation can integrate different knowledge from multiple models into one model, and strengthen each single model by a large margin. We evaluate our system on CCMT2019 Chinese-English and English-Chinese QE dataset, which contains word-level and sentence-level subtasks. Experiment results show our model surpasses previous models to a large extend, demonstrating the effectiveness of our proposed method.
- Subjects :
- Machine translation
business.industry
Computer science
media_common.quotation_subject
02 engineering and technology
Translation (geometry)
computer.software_genre
Machine learning
03 medical and health sciences
0302 clinical medicine
Simple (abstract algebra)
Margin (machine learning)
030221 ophthalmology & optometry
0202 electrical engineering, electronic engineering, information engineering
Effective method
020201 artificial intelligence & image processing
Quality (business)
Artificial intelligence
Language model
business
Baseline (configuration management)
computer
Natural language processing
media_common
Subjects
Details
- ISBN :
- 978-3-030-60456-1
- ISBNs :
- 9783030604561
- Database :
- OpenAIRE
- Journal :
- Natural Language Processing and Chinese Computing ISBN: 9783030604561, NLPCC (2)
- Accession number :
- edsair.doi...........a5d3cf74724f19ee5882005bba912db7
- Full Text :
- https://doi.org/10.1007/978-3-030-60457-8_19