Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kolja Bauer

CleanDIFT: Diffusion Features without Noise

Dec 04, 2024

Nick Stracke, Stefan Andreas Baumann, Kolja Bauer, Frank Fundel, Björn Ommer

Figure 1 for CleanDIFT: Diffusion Features without Noise

Figure 2 for CleanDIFT: Diffusion Features without Noise

Figure 3 for CleanDIFT: Diffusion Features without Noise

Figure 4 for CleanDIFT: Diffusion Features without Noise

Abstract:Internal features from large-scale pre-trained diffusion models have recently been established as powerful semantic descriptors for a wide range of downstream tasks. Works that use these features generally need to add noise to images before passing them through the model to obtain the semantic features, as the models do not offer the most useful features when given images with little to no noise. We show that this noise has a critical impact on the usefulness of these features that cannot be remedied by ensembling with different random noises. We address this issue by introducing a lightweight, unsupervised fine-tuning method that enables diffusion backbones to provide high-quality, noise-free semantic features. We show that these features readily outperform previous diffusion features by a wide margin in a wide variety of extraction setups and downstream tasks, offering better performance than even ensemble-based methods at a fraction of the cost.

* for the project page and code, view https://compvis.github.io/CleanDIFT/

Via

Access Paper or Ask Questions