Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Apr 27, 2018

Andrew Shin, Yoshitaka Ushiku, Tatsuya Harada

Figure 1 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Figure 2 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Figure 3 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Figure 4 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Share this with someone who'll enjoy it:

Abstract:Image description task has been invariably examined in a static manner with qualitative presumptions held to be universally applicable, regardless of the scope or target of the description. In practice, however, different viewers may pay attention to different aspects of the image, and yield different descriptions or interpretations under various contexts. Such diversity in perspectives is difficult to derive with conventional image description techniques. In this paper, we propose a customized image narrative generation task, in which the users are interactively engaged in the generation process by providing answers to the questions. We further attempt to learn the user's interest via repeating such interactive stages, and to automatically reflect the interest in descriptions for new images. Experimental results demonstrate that our model can generate a variety of descriptions from single image that cover a wider range of topics than conventional models, while being customizable to the target user of interaction.

* To Appear at CVPR 2018 as spotlight presentation

View paper on

Share this with someone who'll enjoy it:

Title:Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Paper and Code