Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tianheng Qiu

NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

May 17, 2025

Sangmin Lee, Eunpil Park, Angel Canelo, Hyunhee Park, Youngjo Kim, Hyung-Ju Chun, Xin Jin, Chongyi Li, Chun-Le Guo, Radu Timofte(+26 more)

Figure 1 for NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

Figure 2 for NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

Figure 3 for NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

Figure 4 for NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

Abstract:This paper reviews the NTIRE 2025 Efficient Burst HDR and Restoration Challenge, which aims to advance efficient multi-frame high dynamic range (HDR) and restoration techniques. The challenge is based on a novel RAW multi-frame fusion dataset, comprising nine noisy and misaligned RAW frames with various exposure levels per scene. Participants were tasked with developing solutions capable of effectively fusing these frames while adhering to strict efficiency constraints: fewer than 30 million model parameters and a computational budget under 4.0 trillion FLOPs. A total of 217 participants registered, with six teams finally submitting valid solutions. The top-performing approach achieved a PSNR of 43.22 dB, showcasing the potential of novel methods in this domain. This paper provides a comprehensive overview of the challenge, compares the proposed solutions, and serves as a valuable reference for researchers and practitioners in efficient burst HDR and restoration.

Via

Access Paper or Ask Questions

CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Dec 16, 2024

Tianheng Qiu, Ka Lung Law, Guanghua Pan, Jufei Wang, Xin Gao, Xuan Huang, Hu Wei

Figure 1 for CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Figure 2 for CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Figure 3 for CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Figure 4 for CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Abstract:Unsupervised domain adaptive (UDA) algorithms can markedly enhance the performance of object detectors under conditions of domain shifts, thereby reducing the necessity for extensive labeling and retraining. Current domain adaptive object detection algorithms primarily cater to two-stage detectors, which tend to offer minimal improvements when directly applied to single-stage detectors such as YOLO. Intending to benefit the YOLO detector from UDA, we build a comprehensive domain adaptive architecture using a teacher-student cooperative system for the YOLO detector. In this process, we propose uncertainty learning to cope with pseudo-labeling generated by the teacher model with extreme uncertainty and leverage dynamic data augmentation to asymptotically adapt the teacher-student system to the environment. To address the inability of single-stage object detectors to align at multiple stages, we utilize a unified visual contrastive learning paradigm that aligns instance at backbone and head respectively, which steadily improves the robustness of the detectors in cross-domain tasks. In summary, we present an unsupervised domain adaptive YOLO detector based on visual contrastive learning (CLDA-YOLO), which achieves highly competitive results across multiple domain adaptive datasets without any reduction in inference speed.

Via

Access Paper or Ask Questions

Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring

Dec 29, 2023

Xin Gao, Tianheng Qiu, Xinyu Zhang, Hanlin Bai, Kang Liu, Xuan Huang, Hu Wei, Guoying Zhang, Huaping Liu

Figure 1 for Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring

Figure 2 for Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring

Figure 3 for Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring

Figure 4 for Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring

Abstract:Coarse-to-fine schemes are widely used in traditional single-image motion deblur; however, in the context of deep learning, existing multi-scale algorithms not only require the use of complex modules for feature fusion of low-scale RGB images and deep semantics, but also manually generate low-resolution pairs of images that do not have sufficient confidence. In this work, we propose a multi-scale network based on single-input and multiple-outputs(SIMO) for motion deblurring. This simplifies the complexity of algorithms based on a coarse-to-fine scheme. To alleviate restoration defects impacting detail information brought about by using a multi-scale architecture, we combine the characteristics of real-world blurring trajectories with a learnable wavelet transform module to focus on the directional continuity and frequency features of the step-by-step transitions between blurred images to sharp images. In conclusion, we propose a multi-scale network with a learnable discrete wavelet transform (MLWNet), which exhibits state-of-the-art performance on multiple real-world deblurred datasets, in terms of both subjective and objective quality as well as computational efficiency.

Via

Access Paper or Ask Questions