Back to Search
Start Over
'Is This an Example Image?' – Predicting the Relative Abstractness Level of Image and Text
- Source :
- Lecture Notes in Computer Science ISBN: 9783030157111, ECIR (1)
- Publication Year :
- 2019
- Publisher :
- Springer International Publishing, 2019.
-
Abstract
- Successful multimodal search and retrieval requires the automatic understanding of semantic cross-modal relations, which, however, is still an open research problem. Previous work has suggested the metrics cross-modal mutual information and semantic correlation to model and predict cross-modal semantic relations of image and text. In this paper, we present an approach to predict the (cross-modal) relative abstractness level of a given image-text pair, that is whether the image is an abstraction of the text or vice versa. For this purpose, we introduce a new metric that captures this specific relationship between image and text at the Abstractness Level (ABS). We present a deep learning approach to predict this metric, which relies on an autoencoder architecture that allows us to significantly reduce the required amount of labeled training data. A comprehensive set of publicly available scientific documents has been gathered. Experimental results on a challenging test set demonstrate the feasibility of the approach.
- Subjects :
- Multimodal search
Computer science
business.industry
Deep learning
02 engineering and technology
Mutual information
computer.software_genre
Autoencoder
Set (abstract data type)
03 medical and health sciences
0302 clinical medicine
Test set
Metric (mathematics)
030221 ophthalmology & optometry
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
Natural language processing
Abstraction (linguistics)
Subjects
Details
- ISBN :
- 978-3-030-15711-1
- ISBNs :
- 9783030157111
- Database :
- OpenAIRE
- Journal :
- Lecture Notes in Computer Science ISBN: 9783030157111, ECIR (1)
- Accession number :
- edsair.doi...........763f1789a10f0c0c10e5b41c42e5003d