Back to Search Start Over

A benchmark dataset and evaluation methodology for Chinese zero pronoun translation.

Authors :
Xu, Mingzhou
Wang, Longyue
Liu, Siyou
Wong, Derek F.
Shi, Shuming
Tu, Zhaopeng
Source :
Language Resources & Evaluation; Sep2023, Vol. 57 Issue 3, p1263-1293, 31p
Publication Year :
2023

Abstract

The phenomenon of zero pronoun (ZP) has attracted increasing interest in the machine translation community due to its importance and difficulty. However, previous studies generally evaluate the quality of translating ZPs with BLEU score on MT testsets, which is not expressive or sensitive enough for accurate assessment. To bridge the data and evaluation gaps, we propose a benchmark testset and evaluation metric for target evaluation on Chinese ZP translation. The human-annotated testset covers five challenging genres, which reveal different characteristics of ZPs for comprehensive evaluation. We systematically revisit advanced models on ZP translation and identify current challenges for future exploration. We release data, code, and trained models, which we hope can significantly promote research in this field. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1574020X
Volume :
57
Issue :
3
Database :
Complementary Index
Journal :
Language Resources & Evaluation
Publication Type :
Academic Journal
Accession number :
170029269
Full Text :
https://doi.org/10.1007/s10579-023-09660-5