Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models

Mar 17, 2025

Ziqiang Li, Jun Li, Lizhi Xiong, Zhangjie Fu, Zechao Li

Share this with someone who'll enjoy it:

Abstract:Text-to-image diffusion models have made significant advancements in generating high-quality, diverse images from text prompts. However, the inherent limitations of textual signals often prevent these models from fully capturing specific concepts, thereby reducing their controllability. To address this issue, several approaches have incorporated personalization techniques, utilizing reference images to mine visual concept representations that complement textual inputs and enhance the controllability of text-to-image diffusion models. Despite these advances, a comprehensive, systematic exploration of visual concept mining remains limited. In this paper, we categorize existing research into four key areas: Concept Learning, Concept Erasing, Concept Decomposition, and Concept Combination. This classification provides valuable insights into the foundational principles of Visual Concept Mining (VCM) techniques. Additionally, we identify key challenges and propose future research directions to propel this important and interesting field forward.

* Under review

View paper on

Share this with someone who'll enjoy it:

Title:A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models

Paper and Code