Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Unveiling Concept Attribution in Diffusion Models

Dec 03, 2024

Quang H. Nguyen, Hoang Phan, Khoa D. Doan

Figure 1 for Unveiling Concept Attribution in Diffusion Models

Figure 2 for Unveiling Concept Attribution in Diffusion Models

Figure 3 for Unveiling Concept Attribution in Diffusion Models

Figure 4 for Unveiling Concept Attribution in Diffusion Models

Share this with someone who'll enjoy it:

Abstract:Diffusion models have shown remarkable abilities in generating realistic and high-quality images from text prompts. However, a trained model remains black-box; little do we know about the role of its components in exhibiting a concept such as objects or styles. Recent works employ causal tracing to localize layers storing knowledge in generative models without showing how those layers contribute to the target concept. In this work, we approach the model interpretability problem from a more general perspective and pose a question: \textit{``How do model components work jointly to demonstrate knowledge?''}. We adapt component attribution to decompose diffusion models, unveiling how a component contributes to a concept. Our framework allows effective model editing, in particular, we can erase a concept from diffusion models by removing positive components while remaining knowledge of other concepts. Surprisingly, we also show there exist components that contribute negatively to a concept, which has not been discovered in the knowledge localization approach. Experimental results confirm the role of positive and negative components pinpointed by our framework, depicting a complete view of interpreting generative models. Our code is available at \url{https://github.com/mail-research/CAD-attribution4diffusion}

View paper on

Share this with someone who'll enjoy it:

Title:Unveiling Concept Attribution in Diffusion Models

Paper and Code