Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution

May 29, 2022

Xintong Yu, Hongming Zhang, Ruixin Hong, Yangqiu Song, Changshui Zhang

Figure 1 for VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution

Figure 2 for VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution

Figure 3 for VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution

Figure 4 for VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution

Share this with someone who'll enjoy it:

Abstract:The visual dialog task requires an AI agent to interact with humans in multi-round dialogs based on a visual environment. As a common linguistic phenomenon, pronouns are often used in dialogs to improve the communication efficiency. As a result, resolving pronouns (i.e., grounding pronouns to the noun phrases they refer to) is an essential step towards understanding dialogs. In this paper, we propose VD-PCR, a novel framework to improve Visual Dialog understanding with Pronoun Coreference Resolution in both implicit and explicit ways. First, to implicitly help models understand pronouns, we design novel methods to perform the joint training of the pronoun coreference resolution and visual dialog tasks. Second, after observing that the coreference relationship of pronouns and their referents indicates the relevance between dialog rounds, we propose to explicitly prune the irrelevant history rounds in visual dialog models' input. With pruned input, the models can focus on relevant dialog history and ignore the distraction in the irrelevant one. With the proposed implicit and explicit methods, VD-PCR achieves state-of-the-art experimental results on the VisDial dataset.

* Pattern Recognition, 125, 108540 (2022) * The manuscript version of the paper. The published version is available at https://doi.org/10.1016/j.patcog.2022.108540 . The data, code and models are available at: https://github.com/HKUST- KnowComp/VD-PCR

View paper on

Share this with someone who'll enjoy it:

Title:VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution

Paper and Code