Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:"Is this an example image?" -- Predicting the Relative Abstractness Level of Image and Text

Jan 23, 2019

Christian Otto, Sebastian Holzki, Ralph Ewerth

Figure 1 for "Is this an example image?" -- Predicting the Relative Abstractness Level of Image and Text

Figure 2 for "Is this an example image?" -- Predicting the Relative Abstractness Level of Image and Text

Figure 3 for "Is this an example image?" -- Predicting the Relative Abstractness Level of Image and Text

Figure 4 for "Is this an example image?" -- Predicting the Relative Abstractness Level of Image and Text

Share this with someone who'll enjoy it:

Abstract:Successful multimodal search and retrieval requires the automatic understanding of semantic cross-modal relations, which, however, is still an open research problem. Previous work has suggested the metrics cross-modal mutual information and semantic correlation to model and predict cross-modal semantic relations of image and text. In this paper, we present an approach to predict the (cross-modal) relative abstractness level of a given image-text pair, that is whether the image is an abstraction of the text or vice versa. For this purpose, we introduce a new metric that captures this specific relationship between image and text at the Abstractness Level (ABS). We present a deep learning approach to predict this metric, which relies on an autoencoder architecture that allows us to significantly reduce the required amount of labeled training data. A comprehensive set of publicly available scientific documents has been gathered. Experimental results on a challenging test set demonstrate the feasibility of the approach.

* 14 pages, 6 figures, accepted at ECIR2019

View paper on

Share this with someone who'll enjoy it:

Title:"Is this an example image?" -- Predicting the Relative Abstractness Level of Image and Text

Paper and Code