Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Dec 15, 2021

Zitian Zhang, Chuhua Xian

Figure 1 for Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Figure 2 for Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Figure 3 for Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Figure 4 for Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Share this with someone who'll enjoy it:

Abstract:In this paper, we aim to solve the problem of consistent depth prediction in complex scenes under various illumination conditions. The existing indoor datasets based on RGB-D sensors or virtual rendering have two critical limitations - sparse depth maps (NYU Depth V2) and non-realistic illumination (SUN CG, SceneNet RGB-D). We propose to use internet 3D indoor scenes and manually tune their illuminations to render photo-realistic RGB photos and their corresponding depth and BRDF maps, obtaining a new indoor depth dataset called Vari dataset. We propose a simple convolutional block named DCA by applying depthwise separable dilated convolution on encoded features to process global information and reduce parameters. We perform cross attention on these dilated features to retain the consistency of depth prediction under different illuminations. Our method is evaluated by comparing it with current state-of-the-art methods on Vari dataset and a significant improvement is observed in our experiments. We also conduct the ablation study, finetune our model on NYU Depth V2 and also evaluate on real-world data to further validate the effectiveness of our DCA block. The code, pre-trained weights and Vari dataset are open-sourced.

* 14 pages

View paper on

Share this with someone who'll enjoy it:

Title:Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Paper and Code