Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation

Jun 11, 2024

Yufan Zhu, Chongzhi Ran, Mingtao Feng, Weisheng Dong, Antonio M. López, Guangming Shi

Figure 1 for Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation

Figure 2 for Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation

Figure 3 for Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation

Figure 4 for Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation

Share this with someone who'll enjoy it:

Abstract:Virtual engines have the capability to generate dense depth maps for various synthetic scenes, making them invaluable for training depth estimation models. However, synthetic colors often exhibit significant discrepancies compared to real-world colors, thereby posing challenges for depth estimation in real-world scenes, particularly in complex and uncertain environments encountered in unsupervised monocular depth estimation tasks. To address this issue, we propose Back2Color, a framework that predicts realistic colors from depth utilizing a model trained on real-world data, thus facilitating the transformation of synthetic colors into real-world counterparts. Additionally, by employing the Syn-Real CutMix method for joint training with both real-world unsupervised and synthetic supervised depth samples, we achieve improved performance in monocular depth estimation for real-world scenes. Moreover, to comprehensively address the impact of non-rigid motions on depth estimation, we propose an auto-learning uncertainty temporal-spatial fusion method (Auto-UTSF), which integrates the benefits of unsupervised learning in both temporal and spatial dimensions. Furthermore, we design a depth estimation network (VADepth) based on the Vision Attention Network. Our Back2Color framework demonstrates state-of-the-art performance, as evidenced by improvements in performance metrics and the production of fine-grained details in our predictions, particularly on challenging datasets such as Cityscapes for unsupervised depth estimation.

View paper on

Share this with someone who'll enjoy it:

Title:Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation

Paper and Code