Back to Search
Start Over
A Principled Approach Using Fuzzy Set Theory for Passage-Based Document Retrieval
- Source :
- IEEE Transactions on Fuzzy Systems. 29:1967-1977
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- In this article, we present a novel principled approach to passage-based (document) retrieval using fuzzy set theory. The approach formulates passage score combination according to general relevance decision principles. By operationalizing these principles using aggregation operators of fuzzy set theory, our approach justifies the common heuristics of taking the maximum constituent passage score as the overall document score. Experiments show that this heuristics is only the near best, with some fuzzy set aggregation operators stipulated in our approach being better methods. The significance of our principled approach is the applicability of many passage score combination methods, potentially bringing further performance enhancement. Experiments on several text retrieval conference collections demonstrate that our approach performs significantly better than document-based retrieval. While recent works in the literature mostly employ document-based rather than passage-based retrieval due to the common conception that document length normalization solves the problem of varying document lengths, our results show that document length normalization alone is not sufficient, especially in pseudo-relevance feedback retrieval.
- Subjects :
- Normalization (statistics)
Operationalization
Computer science
business.industry
Applied Mathematics
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
Fuzzy set
computer.software_genre
Semantics
Computational Theory and Mathematics
Artificial Intelligence
Control and Systems Engineering
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
Relevance (information retrieval)
Artificial intelligence
Document retrieval
business
Heuristics
Text Retrieval Conference
computer
Natural language processing
Subjects
Details
- ISSN :
- 19410034 and 10636706
- Volume :
- 29
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Fuzzy Systems
- Accession number :
- edsair.doi...........a738c7bc423dc0dfa5e22e2309553cd5
- Full Text :
- https://doi.org/10.1109/tfuzz.2020.2990110