Back to Search Start Over

mRobust04: A Multilingual Version of the TREC Robust 2004 Benchmark

Authors :
Jeronymo, Vitor
Nascimento, Mauricio
Lotufo, Roberto
Nogueira, Rodrigo
Publication Year :
2022

Abstract

Robust 2004 is an information retrieval benchmark whose large number of judgments per query make it a reliable evaluation dataset. In this paper, we present mRobust04, a multilingual version of Robust04 that was translated to 8 languages using Google Translate. We also provide results of three different multilingual retrievers on this dataset. The dataset is available at https://huggingface.co/datasets/unicamp-dl/mrobust<br />Comment: 4 pages

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2209.13738
Document Type :
Working Paper