Back to Search Start Over

Scalable Multi-grained Cross-modal Similarity Query with Interpretability

Authors :
Lixin Xu
Xianfang Wang
Mingdong Zhu
Derong Shen
Source :
Data Science and Engineering. 6:280-293
Publication Year :
2021
Publisher :
Springer Science and Business Media LLC, 2021.

Abstract

Cross-modal similarity query has become a highlighted research topic for managing multimodal datasets such as images and texts. Existing researches generally focus on query accuracy by designing complex deep neural network models and hardly consider query efficiency and interpretability simultaneously, which are vital properties of cross-modal semantic query processing system on large-scale datasets. In this work, we investigate multi-grained common semantic embedding representations of images and texts and integrate interpretable query index into the deep neural network by developing a novel Multi-grained Cross-modal Query with Interpretability (MCQI) framework. The main contributions are as follows: (1) By integrating coarse-grained and fine-grained semantic learning models, a multi-grained cross-modal query processing architecture is proposed to ensure the adaptability and generality of query processing. (2) In order to capture the latent semantic relation between images and texts, the framework combines LSTM and attention mode, which enhances query accuracy for the cross-modal query and constructs the foundation for interpretable query processing. (3) Index structure and corresponding nearest neighbor query algorithm are proposed to boost the efficiency of interpretable queries. (4) A distributed query algorithm is proposed to improve the scalability of our framework. Comparing with state-of-the-art methods on widely used cross-modal datasets, the experimental results show the effectiveness of our MCQI approach.

Details

ISSN :
23641541 and 23641185
Volume :
6
Database :
OpenAIRE
Journal :
Data Science and Engineering
Accession number :
edsair.doi...........c921c311ebe09fb6ef392051305c1beb
Full Text :
https://doi.org/10.1007/s41019-021-00162-4