Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Apr 19, 2019

Julia Kruk, Jonah Lubin, Karan Sikka, Xiao Lin, Dan Jurafsky, Ajay Divakaran

Figure 1 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Figure 2 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Figure 3 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Figure 4 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Share this with someone who'll enjoy it:

Abstract:Computing author intent from multimodal data like Instagram posts requires modeling a complex relationship between text and image. For example a caption might reflect ironically on the image, so neither the caption nor the image is a mere transcript of the other. Instead they combine -- via what has been called meaning multiplication -- to create a new meaning that has a more complex relation to the literal meanings of text and image. Here we introduce a multimodal dataset of 1299 Instagram post labeled for three orthogonal taxonomies: the authorial intent behind the image-caption pair, the contextual relationship between the literal meanings of the image and caption, and the semiotic relationship between the signified meanings of the image and caption. We build a baseline deep multimodal classifier to validate the taxonomy, showing that employing both text and image improves intent detection by 8% compared to using only image modality, demonstrating the commonality of non-intersective meaning multiplication. Our dataset offers an important resource for the study of the rich meanings that results from pairing text and image.

View paper on

Share this with someone who'll enjoy it:

Title:Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Paper and Code