Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

Sep 09, 2024

Bram Willemsen, Gabriel Skantze

Figure 1 for Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

Figure 2 for Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

Figure 3 for Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

Figure 4 for Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

Share this with someone who'll enjoy it:

Abstract:We propose an approach to referring expression generation (REG) in visually grounded dialogue that is meant to produce referring expressions (REs) that are both discriminative and discourse-appropriate. Our method constitutes a two-stage process. First, we model REG as a text- and image-conditioned next-token prediction task. REs are autoregressively generated based on their preceding linguistic context and a visual representation of the referent. Second, we propose the use of discourse-aware comprehension guiding as part of a generate-and-rerank strategy through which candidate REs generated with our REG model are reranked based on their discourse-dependent discriminatory power. Results from our human evaluation indicate that our proposed two-stage approach is effective in producing discriminative REs, with higher performance in terms of text-image retrieval accuracy for reranked REs compared to those generated using greedy decoding.

* Accepted for publication at INLG 2024

View paper on

Share this with someone who'll enjoy it:

Title:Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

Paper and Code