Back to Search
Start Over
Scalable Multi-grained Cross-modal Similarity Query with Interpretability
- Source :
- Data Science and Engineering. 6:280-293
- Publication Year :
- 2021
- Publisher :
- Springer Science and Business Media LLC, 2021.
-
Abstract
- Cross-modal similarity query has become a highlighted research topic for managing multimodal datasets such as images and texts. Existing researches generally focus on query accuracy by designing complex deep neural network models and hardly consider query efficiency and interpretability simultaneously, which are vital properties of cross-modal semantic query processing system on large-scale datasets. In this work, we investigate multi-grained common semantic embedding representations of images and texts and integrate interpretable query index into the deep neural network by developing a novel Multi-grained Cross-modal Query with Interpretability (MCQI) framework. The main contributions are as follows: (1) By integrating coarse-grained and fine-grained semantic learning models, a multi-grained cross-modal query processing architecture is proposed to ensure the adaptability and generality of query processing. (2) In order to capture the latent semantic relation between images and texts, the framework combines LSTM and attention mode, which enhances query accuracy for the cross-modal query and constructs the foundation for interpretable query processing. (3) Index structure and corresponding nearest neighbor query algorithm are proposed to boost the efficiency of interpretable queries. (4) A distributed query algorithm is proposed to improve the scalability of our framework. Comparing with state-of-the-art methods on widely used cross-modal datasets, the experimental results show the effectiveness of our MCQI approach.
- Subjects :
- Structure (mathematical logic)
Semantic query
Artificial neural network
Computer science
Computational Mechanics
02 engineering and technology
computer.software_genre
Computer Science Applications
k-nearest neighbors algorithm
Modal
020204 information systems
Scalability
0202 electrical engineering, electronic engineering, information engineering
Embedding
020201 artificial intelligence & image processing
Data mining
computer
Interpretability
Subjects
Details
- ISSN :
- 23641541 and 23641185
- Volume :
- 6
- Database :
- OpenAIRE
- Journal :
- Data Science and Engineering
- Accession number :
- edsair.doi...........c921c311ebe09fb6ef392051305c1beb
- Full Text :
- https://doi.org/10.1007/s41019-021-00162-4