Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding

May 19, 2023

Chenchi Zhang, Jun Xiao, Lei Chen, Jian Shao, Long Chen

Figure 1 for TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding

Figure 2 for TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding

Figure 3 for TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding

Figure 4 for TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding

Share this with someone who'll enjoy it:

Abstract:Prompt tuning has achieved great success in transferring the knowledge from large pretrained vision-language models into downstream tasks, and has dominated the performance on visual grounding (VG). However, almost all existing prompt tuning paradigms suffer from poor interpretability. In this paper, we argue that their poor interpretability is attributed to the holistic prompt generation and inference process. By "holistic", we mean that they usually directly learn a set of vectors as the prompt (i.e., prompt generation), and use the learned global prompt to augment the textual input for the VG model (i.e., prompt inference). To this end, we propose a new prompt construction paradigm with explicit explainable ability, named TreePrompt. Specifically, we first deconstruct a complex sentence into a tree, that is consistent with human reasoning. Then, following the syntax tree, we compose a structured prompt in a bottom-up manner. Thanks to this step-by-step prompt construction process, each intermediate prompt (i.e., tree node) permits us to understand the reasoning process. Extensive ablations on various backbones and benchmarks consistently demonstrate the effectiveness and interpretability of our TreePrompt.

View paper on

Share this with someone who'll enjoy it:

Title:TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding

Paper and Code