51. Backretrieval: An Image-Pivoted Evaluation Metric for Cross-Lingual Text Representations Without Parallel Corpora
- Author
-
Niall Twomey, Mikhail Fain, Danushka Bollegala, Diaz, Fernando, Shah, Chirag, Suel, Torsten, Castells, Pablo, Jones, Rosie, and Sakai, Tetsuya
- Subjects
FOS: Computer and information sciences ,Ground truth ,Computer Science - Computation and Language ,Machine translation ,business.industry ,Computer science ,media_common.quotation_subject ,computer.software_genre ,Computer Science - Information Retrieval ,Image (mathematics) ,Parallel language ,Metric (mathematics) ,Embedding ,Quality (business) ,Artificial intelligence ,Proxy (statistics) ,business ,Computation and Language (cs.CL) ,computer ,Information Retrieval (cs.IR) ,Natural language processing ,media_common - Abstract
Cross-lingual text representations have gained popularity lately and act as the backbone of many tasks such as unsupervised machine translation and cross-lingual information retrieval, to name a few. However, evaluation of such representations is difficult in the domains beyond standard benchmarks due to the necessity of obtaining domain-specific parallel language data across different pairs of languages. In this paper, we propose an automatic metric for evaluating the quality of cross-lingual textual representations using images as a proxy in a paired image-text evaluation dataset. Experimentally, Backretrieval is shown to highly correlate with ground truth metrics on annotated datasets, and our analysis shows statistically significant improvements over baselines. Our experiments conclude with a case study on a recipe dataset without parallel cross-lingual data. We illustrate how to judge cross-lingual embedding quality with Backretrieval, and validate the outcome with a small human study., Comment: SIGIR 2021
- Published
- 2021