Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning to Select: A Fully Attentive Approach for Novel Object Captioning

Jun 02, 2021

Marco Cagrandi, Marcella Cornia, Matteo Stefanini, Lorenzo Baraldi, Rita Cucchiara

Figure 1 for Learning to Select: A Fully Attentive Approach for Novel Object Captioning

Figure 2 for Learning to Select: A Fully Attentive Approach for Novel Object Captioning

Figure 3 for Learning to Select: A Fully Attentive Approach for Novel Object Captioning

Figure 4 for Learning to Select: A Fully Attentive Approach for Novel Object Captioning

Share this with someone who'll enjoy it:

Abstract:Image captioning models have lately shown impressive results when applied to standard datasets. Switching to real-life scenarios, however, constitutes a challenge due to the larger variety of visual concepts which are not covered in existing training sets. For this reason, novel object captioning (NOC) has recently emerged as a paradigm to test captioning models on objects which are unseen during the training phase. In this paper, we present a novel approach for NOC that learns to select the most relevant objects of an image, regardless of their adherence to the training set, and to constrain the generative process of a language model accordingly. Our architecture is fully-attentive and end-to-end trainable, also when incorporating constraints. We perform experiments on the held-out COCO dataset, where we demonstrate improvements over the state of the art, both in terms of adaptability to novel objects and caption quality.

* ICMR 2021

View paper on

Share this with someone who'll enjoy it:

Title:Learning to Select: A Fully Attentive Approach for Novel Object Captioning

Paper and Code