Back to Search Start Over

Explicit relevance models in intent-oriented information retrieval diversification

Authors :
Pablo Castells
Saúl Vargas
David Vallet
UAM. Departamento de Ingeniería Informática
Recuperación de información (ING EPS-008)
Source :
SIGIR, Biblos-e Archivo. Repositorio Institucional de la UAM, instname
Publication Year :
2012
Publisher :
ACM, 2012.

Abstract

This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, http://dx.doi.org/10.1145/2348283.2348297.<br />The intent-oriented search diversification methods developed in the field so far tend to build on generative views of the retrieval system to be diversified. Core algorithm components in particular redundancy assessment are expressed in terms of the probability to observe documents, rather than the probability that the documents be relevant. This has been sometimes described as a view considering the selection of a single document in the underlying task model. In this paper we propose an alternative formulation of aspect-based diversification algorithms which explicitly includes a formal relevance model. We develop means for the effective computation of the new formulation, and we test the resulting algorithm empirically. We report experiments on search and recommendation tasks showing competitive or better performance than the original diversification algorithms. The relevance-based formulation has further interesting properties, such as unifying two well-known state of the art algorithms into a single version. The relevance-based approach opens alternative possibilities for further formal connections and developments as natural extensions of the framework. We illustrate this by modeling tolerance to redundancy as an explicit configurable parameter, which can be set to better suit the characteristics of the IR task, or the evaluation metrics, as we illustrate empirically.<br />This work was supported by the national Spanish projects TIN2011-28538-C02-01 and S2009TIC-1542.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Accession number :
edsair.doi.dedup.....fac60aac10a95e26460141811e462d76
Full Text :
https://doi.org/10.1145/2348283.2348297