Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models

May 01, 2024

Zheng Zeng, Valentin Deschaintre, Iliyan Georgiev, Yannick Hold-Geoffroy, Yiwei Hu, Fujun Luan, Ling-Qi Yan, Miloš Hašan

$Figure 1 for RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models$

$Figure 2 for RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models$

$Figure 3 for RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models$

$Figure 4 for RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models$

Share this with someone who'll enjoy it:

Abstract:The three areas of realistic forward rendering, per-pixel inverse rendering, and generative image synthesis may seem like separate and unrelated sub-fields of graphics and vision. However, recent work has demonstrated improved estimation of per-pixel intrinsic channels (albedo, roughness, metallicity) based on a diffusion architecture; we call this the RGB$\rightarrow$X problem. We further show that the reverse problem of synthesizing realistic images given intrinsic channels, X$\rightarrow$RGB, can also be addressed in a diffusion framework. Focusing on the image domain of interior scenes, we introduce an improved diffusion model for RGB$\rightarrow$X, which also estimates lighting, as well as the first diffusion X$\rightarrow$RGB model capable of synthesizing realistic images from (full or partial) intrinsic channels. Our X$\rightarrow$RGB model explores a middle ground between traditional rendering and generative models: we can specify only certain appearance properties that should be followed, and give freedom to the model to hallucinate a plausible version of the rest. This flexibility makes it possible to use a mix of heterogeneous training datasets, which differ in the available channels. We use multiple existing datasets and extend them with our own synthetic and real data, resulting in a model capable of extracting scene properties better than previous work and of generating highly realistic images of interior scenes.

* SIGGRAPH Conference Papers '24, July 27-August 1, 2024, Denver, CO, USA

View paper on

Share this with someone who'll enjoy it:

Title:RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models

Paper and Code