Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Concept Algebra for Text-Controlled Vision Models

Feb 07, 2023

Zihao Wang, Lin Gui, Jeffrey Negrea, Victor Veitch

Figure 1 for Concept Algebra for Text-Controlled Vision Models

Figure 2 for Concept Algebra for Text-Controlled Vision Models

Figure 3 for Concept Algebra for Text-Controlled Vision Models

Figure 4 for Concept Algebra for Text-Controlled Vision Models

Share this with someone who'll enjoy it:

Abstract:This paper concerns the control of text-guided generative models, where a user provides a natural language prompt and the model generates samples based on this input. Prompting is intuitive, general, and flexible. However, there are significant limitations: prompting can fail in surprising ways, and it is often unclear how to find a prompt that will elicit some desired target behavior. A core difficulty for developing methods to overcome these issues is that failures are know-it-when-you-see-it -- it's hard to fix bugs if you can't state precisely what the model should have done! In this paper, we introduce a formalization of "what the user intended" in terms of latent concepts implicit to the data generating process that the model was trained on. This formalization allows us to identify some fundamental limitations of prompting. We then use the formalism to develop concept algebra to overcome these limitations. Concept algebra is a way of directly manipulating the concepts expressed in the output through algebraic operations on a suitably defined representation of input prompts. We give examples using concept algebra to overcome limitations of prompting, including concept transfer through arithmetic, and concept nullification through projection. Code available at https://github.com/zihao12/concept-algebra.

View paper on

Share this with someone who'll enjoy it:

Title:Concept Algebra for Text-Controlled Vision Models

Paper and Code