Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation

Aug 26, 2024

Anh-Dzung Doan, Vu Minh Hieu Phan, Surabhi Gupta, Markus Wagner, Tat-Jun Chin, Ian Reid

Figure 1 for TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation

Figure 2 for TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation

Figure 3 for TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation

Figure 4 for TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation

Share this with someone who'll enjoy it:

Abstract:Infrared imaging offers resilience against changing lighting conditions by capturing object temperatures. Yet, in few scenarios, its lack of visual details compared to daytime visible images, poses a significant challenge for human and machine interpretation. This paper proposes a novel diffusion method, dubbed Temporally Consistent Patch Diffusion Models (TC-DPM), for infrared-to-visible video translation. Our method, extending the Patch Diffusion Model, consists of two key components. Firstly, we propose a semantic-guided denoising, leveraging the strong representations of foundational models. As such, our method faithfully preserves the semantic structure of generated visible images. Secondly, we propose a novel temporal blending module to guide the denoising trajectory, ensuring the temporal consistency between consecutive frames. Experiment shows that TC-PDM outperforms state-of-the-art methods by 35.3% in FVD for infrared-to-visible video translation and by 6.1% in AP50 for day-to-night object detection. Our code is publicly available at https://github.com/dzungdoan6/tc-pdm

* Technical report

View paper on

Share this with someone who'll enjoy it:

Title:TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation

Paper and Code