Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yingpeng Zhang

GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation

Sep 27, 2024

Jiawei Lu, Yingpeng Zhang, Zengjun Zhao, He Wang, Kun Zhou, Tianjia Shao

Figure 1 for GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation

Figure 2 for GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation

Figure 3 for GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation

Figure 4 for GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation

Abstract:Large-scale text-guided image diffusion models have shown astonishing results in text-to-image (T2I) generation. However, applying these models to synthesize textures for 3D geometries remains challenging due to the domain gap between 2D images and textures on a 3D surface. Early works that used a projecting-and-inpainting approach managed to preserve generation diversity but often resulted in noticeable artifacts and style inconsistencies. While recent methods have attempted to address these inconsistencies, they often introduce other issues, such as blurring, over-saturation, or over-smoothing. To overcome these challenges, we propose a novel text-to-texture synthesis framework that leverages pretrained diffusion models. We first introduce a local attention reweighing mechanism in the self-attention layers to guide the model in concentrating on spatial-correlated patches across different views, thereby enhancing local details while preserving cross-view consistency. Additionally, we propose a novel latent space merge pipeline, which further ensures consistency across different viewpoints without sacrificing too much diversity. Our method significantly outperforms existing state-of-the-art techniques regarding texture consistency and visual quality, while delivering results much faster than distillation-based methods. Importantly, our framework does not require additional training or fine-tuning, making it highly adaptable to a wide range of models available on public platforms.

Via

Access Paper or Ask Questions

GenesisTex: Adapting Image Denoising Diffusion to Texture Space

Mar 26, 2024

Chenjian Gao, Boyan Jiang, Xinghui Li, Yingpeng Zhang, Qian Yu

Figure 1 for GenesisTex: Adapting Image Denoising Diffusion to Texture Space

Figure 2 for GenesisTex: Adapting Image Denoising Diffusion to Texture Space

Figure 3 for GenesisTex: Adapting Image Denoising Diffusion to Texture Space

Figure 4 for GenesisTex: Adapting Image Denoising Diffusion to Texture Space

Abstract:We present GenesisTex, a novel method for synthesizing textures for 3D geometries from text descriptions. GenesisTex adapts the pretrained image diffusion model to texture space by texture space sampling. Specifically, we maintain a latent texture map for each viewpoint, which is updated with predicted noise on the rendering of the corresponding viewpoint. The sampled latent texture maps are then decoded into a final texture map. During the sampling process, we focus on both global and local consistency across multiple viewpoints: global consistency is achieved through the integration of style consistency mechanisms within the noise prediction network, and low-level consistency is achieved by dynamically aligning latent textures. Finally, we apply reference-based inpainting and img2img on denser views for texture refinement. Our approach overcomes the limitations of slow optimization in distillation-based methods and instability in inpainting-based methods. Experiments on meshes from various sources demonstrate that our method surpasses the baseline methods quantitatively and qualitatively.

* 12 pages, 10 figures

Via

Access Paper or Ask Questions