Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Dec 03, 2024

Qing Zhang, Zehao Chen, Jinguang Tong, Jing Zhang, Jie Hong, Xuesong Li

Figure 1 for Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Figure 2 for Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Figure 3 for Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Figure 4 for Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Share this with someone who'll enjoy it:

Abstract:Despite recent advances in text-to-3D generation techniques, current methods often suffer from geometric inconsistencies, commonly referred to as the Janus Problem. This paper identifies the root cause of the Janus Problem: viewpoint generation bias in diffusion models, which creates a significant gap between the actual generated viewpoint and the expected one required for optimizing the 3D model. To address this issue, we propose a tuning-free approach called the Attention and CLIP Guidance (ACG) mechanism. ACG enhances desired viewpoints by adaptively controlling cross-attention maps, employs CLIP-based view-text similarities to filter out erroneous viewpoints, and uses a coarse-to-fine optimization strategy with staged prompts to progressively refine 3D generation. Extensive experiments demonstrate that our method significantly reduces the Janus Problem without compromising generation speed, establishing ACG as an efficient, plug-and-play component for existing text-to-3D frameworks.

View paper on

Share this with someone who'll enjoy it:

Title:Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Paper and Code