Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:OmniControlNet: Dual-stage Integration for Conditional Image Generation

Jun 09, 2024

Yilin Wang, Haiyang Xu, Xiang Zhang, Zeyuan Chen, Zhizhou Sha, Zirui Wang, Zhuowen Tu

Figure 1 for OmniControlNet: Dual-stage Integration for Conditional Image Generation

Figure 2 for OmniControlNet: Dual-stage Integration for Conditional Image Generation

Figure 3 for OmniControlNet: Dual-stage Integration for Conditional Image Generation

Figure 4 for OmniControlNet: Dual-stage Integration for Conditional Image Generation

Share this with someone who'll enjoy it:

Abstract:We provide a two-way integration for the widely adopted ControlNet by integrating external condition generation algorithms into a single dense prediction method and incorporating its individually trained image generation processes into a single model. Despite its tremendous success, the ControlNet of a two-stage pipeline bears limitations in being not self-contained (e.g. calls the external condition generation algorithms) with a large model redundancy (separately trained models for different types of conditioning inputs). Our proposed OmniControlNet consolidates 1) the condition generation (e.g., HED edges, depth maps, user scribble, and animal pose) by a single multi-tasking dense prediction algorithm under the task embedding guidance and 2) the image generation process for different conditioning types under the textual embedding guidance. OmniControlNet achieves significantly reduced model complexity and redundancy while capable of producing images of comparable quality for conditioned text-to-image generation.

* Accepted to CVPR 2024 Workshop: Generative Models for Computer Vision

View paper on

Share this with someone who'll enjoy it:

Title:OmniControlNet: Dual-stage Integration for Conditional Image Generation

Paper and Code