Back to Search
Start Over
Unsupervised Visual and Textual Information Fusion in CBMIR Using Graph-Based Methods
- Source :
- ACM Transactions on Information Systems. 33:1-31
- Publication Year :
- 2015
- Publisher :
- Association for Computing Machinery (ACM), 2015.
-
Abstract
- Multimedia collections are more than ever growing in size and diversity. Effective multimedia retrieval systems are thus critical to access these datasets from the end-user perspective and in a scalable way. We are interested in repositories of image/text multimedia objects and we study multimodal information fusion techniques in the context of content-based multimedia information retrieval. We focus on graph-based methods, which have proven to provide state-of-the-art performances. We particularly examine two such methods: cross-media similarities and random-walk-based scores. From a theoretical viewpoint, we propose a unifying graph-based framework, which encompasses the two aforementioned approaches. Our proposal allows us to highlight the core features one should consider when using a graph-based technique for the combination of visual and textual information. We compare cross-media and random-walk-based results using three different real-world datasets. From a practical standpoint, our extended empirical analyses allow us to provide insights and guidelines about the use of graph-based methods for multimodal information fusion in content-based multimedia information retrieval.
Details
- ISSN :
- 15582868 and 10468188
- Volume :
- 33
- Database :
- OpenAIRE
- Journal :
- ACM Transactions on Information Systems
- Accession number :
- edsair.doi...........315e40bbad2a2c3aec8eb8bc492177fd
- Full Text :
- https://doi.org/10.1145/2699668