Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lanqing guo

Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Nov 17, 2024

Yan Zheng, Zhenxiao Liang, Xiaoyan Cong, Lanqing guo, Yuehao Wang, Peihao Wang, Zhangyang Wang

Figure 1 for Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Figure 2 for Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Figure 3 for Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Figure 4 for Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Abstract:We explore the oscillatory behavior observed in inversion methods applied to large-scale text-to-image diffusion models, with a focus on the "Flux" model. By employing a fixed-point-inspired iterative approach to invert real-world images, we observe that the solution does not achieve convergence, instead oscillating between distinct clusters. Through both toy experiments and real-world diffusion models, we demonstrate that these oscillating clusters exhibit notable semantic coherence. We offer theoretical insights, showing that this behavior arises from oscillatory dynamics in rectified flow models. Building on this understanding, we introduce a simple and fast distribution transfer technique that facilitates image enhancement, stroke-based recoloring, as well as visual prompt-guided image editing. Furthermore, we provide quantitative results demonstrating the effectiveness of our method for tasks such as image enhancement, makeup transfer, reconstruction quality, and guided sampling quality. Higher-quality examples of videos and images are available at \href{https://yanyanzheng96.github.io/oscillation_inversion/}{this link}.

Via

Access Paper or Ask Questions