Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tae Hyun Kim

LAN: Learning to Adapt Noise for Image Denoising

Dec 14, 2024

Changjin Kim, Tae Hyun Kim, Sungyong Baik

Abstract:Removing noise from images, a.k.a image denoising, can be a very challenging task since the type and amount of noise can greatly vary for each image due to many factors including a camera model and capturing environments. While there have been striking improvements in image denoising with the emergence of advanced deep learning architectures and real-world datasets, recent denoising networks struggle to maintain performance on images with noise that has not been seen during training. One typical approach to address the challenge would be to adapt a denoising network to new noise distribution. Instead, in this work, we shift our focus to adapting the input noise itself, rather than adapting a network. Thus, we keep a pretrained network frozen, and adapt an input noise to capture the fine-grained deviations. As such, we propose a new denoising algorithm, dubbed Learning-to-Adapt-Noise (LAN), where a learnable noise offset is directly added to a given noisy image to bring a given input noise closer towards the noise distribution a denoising network is trained to handle. Consequently, the proposed framework exhibits performance improvement on images with unseen noise, displaying the potential of the proposed research direction. The code is available at https://github.com/chjinny/LAN

* CVPR2024

Via

Access Paper or Ask Questions

Deep Variational Bayesian Modeling of Haze Degradation Process

Dec 04, 2024

Eun Woo Im, Junsung Shin, Sungyong Baik, Tae Hyun Kim

Figure 1 for Deep Variational Bayesian Modeling of Haze Degradation Process

Figure 2 for Deep Variational Bayesian Modeling of Haze Degradation Process

Figure 3 for Deep Variational Bayesian Modeling of Haze Degradation Process

Figure 4 for Deep Variational Bayesian Modeling of Haze Degradation Process

Abstract:Relying on the representation power of neural networks, most recent works have often neglected several factors involved in haze degradation, such as transmission (the amount of light reaching an observer from a scene over distance) and atmospheric light. These factors are generally unknown, making dehazing problems ill-posed and creating inherent uncertainties. To account for such uncertainties and factors involved in haze degradation, we introduce a variational Bayesian framework for single image dehazing. We propose to take not only a clean image and but also transmission map as latent variables, the posterior distributions of which are parameterized by corresponding neural networks: dehazing and transmission networks, respectively. Based on a physical model for haze degradation, our variational Bayesian framework leads to a new objective function that encourages the cooperation between them, facilitating the joint training of and thereby boosting the performance of each other. In our framework, a dehazing network can estimate a clean image independently of a transmission map estimation during inference, introducing no overhead. Furthermore, our model-agnostic framework can be seamlessly incorporated with other existing dehazing networks, greatly enhancing the performance consistently across datasets and models.

* In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management 2023 Oct 21 (pp. 895-904)
* Published in CIKM 2023, 10 pages, 9 figures

Via

Access Paper or Ask Questions

Harnessing Meta-Learning for Improving Full-Frame Video Stabilization

Mar 06, 2024

Muhammad Kashif Ali, Eun Woo Im, Dongjin Kim, Tae Hyun Kim

Abstract:Video stabilization is a longstanding computer vision problem, particularly pixel-level synthesis solutions for video stabilization which synthesize full frames add to the complexity of this task. These techniques aim to stabilize videos by synthesizing full frames while enhancing the stability of the considered video. This intensifies the complexity of the task due to the distinct mix of unique motion profiles and visual content present in each video sequence, making robust generalization with fixed parameters difficult. In our study, we introduce a novel approach to enhance the performance of pixel-level synthesis solutions for video stabilization by adapting these models to individual input video sequences. The proposed adaptation exploits low-level visual cues accessible during test-time to improve both the stability and quality of resulting videos. We highlight the efficacy of our methodology of "test-time adaptation" through simple fine-tuning of one of these models, followed by significant stability gain via the integration of meta-learning techniques. Notably, significant improvement is achieved with only a single adaptation step. The versatility of the proposed algorithm is demonstrated by consistently improving the performance of various pixel-level synthesis models for video stabilization in real-world scenarios.

Via

Access Paper or Ask Questions

Efficient Model Agnostic Approach for Implicit Neural Representation Based Arbitrary-Scale Image Super-Resolution

Nov 20, 2023

Young Jae Oh, Jihun Kim, Tae Hyun Kim

Abstract:Single image super-resolution (SISR) has experienced significant advancements, primarily driven by deep convolutional networks. Traditional networks, however, are limited to upscaling images to a fixed scale, leading to the utilization of implicit neural functions for generating arbitrarily scaled images. Nevertheless, these methodologies have imposed substantial computational demands as they involve querying every target pixel to a single resource-intensive decoder. In this paper, we introduce a novel and efficient framework, the Mixture of Experts Implicit Super-Resolution (MoEISR), which enables super-resolution at arbitrary scales with significantly increased computational efficiency without sacrificing reconstruction quality. MoEISR dynamically allocates the most suitable decoding expert to each pixel using a lightweight mapper module, allowing experts with varying capacities to reconstruct pixels across regions with diverse complexities. Our experiments demonstrate that MoEISR successfully reduces up to 73% in floating point operations (FLOPs) while delivering comparable or superior peak signal-to-noise ratio (PSNR).

Via

Access Paper or Ask Questions

NoiseTransfer: Image Noise Generation with Contrastive Embeddings

Jan 31, 2023

Seunghwan Lee, Tae Hyun Kim

Abstract:Deep image denoising networks have achieved impressive success with the help of a considerably large number of synthetic train datasets. However, real-world denoising is a still challenging problem due to the dissimilarity between distributions of real and synthetic noisy datasets. Although several real-world noisy datasets have been presented, the number of train datasets (i.e., pairs of clean and real noisy images) is limited, and acquiring more real noise datasets is laborious and expensive. To mitigate this problem, numerous attempts to simulate real noise models using generative models have been studied. Nevertheless, previous works had to train multiple networks to handle multiple different noise distributions. By contrast, we propose a new generative model that can synthesize noisy images with multiple different noise distributions. Specifically, we adopt recent contrastive learning to learn distinguishable latent features of the noise. Moreover, our model can generate new noisy images by transferring the noise characteristics solely from a single reference noisy image. We demonstrate the accuracy and the effectiveness of our noise model for both known and unknown noise removal.

* ACCV 2022 oral

Via

Access Paper or Ask Questions

Learning Task Agnostic Temporal Consistency Correction

Jun 08, 2022

Muhammad Kashif Ali, Dongjin Kim, Tae Hyun Kim

Figure 1 for Learning Task Agnostic Temporal Consistency Correction

Figure 2 for Learning Task Agnostic Temporal Consistency Correction

Figure 3 for Learning Task Agnostic Temporal Consistency Correction

Figure 4 for Learning Task Agnostic Temporal Consistency Correction

Abstract:Due to the scarcity of video processing methodologies, image processing operations are naively extended to the video domain by processing each frame independently. This disregard for the temporal connection in video processing often leads to severe temporal inconsistencies. State-of-the-art techniques that address these inconsistencies rely on the availability of unprocessed videos to siphon consistent video dynamics to restore the temporal consistency of frame-wise processed videos. We propose a novel general framework for this task that learns to infer consistent motion dynamics from inconsistent videos to mitigate the temporal flicker while preserving the perceptual quality for both the temporally neighboring and relatively distant frames. The proposed framework produces state-of-the-art results on two large-scale datasets, DAVIS and videvo.net, processed by numerous image processing tasks in a feed-forward manner. The code and the trained models will be released upon acceptance.

Via

Access Paper or Ask Questions

Rich CNN-Transformer Feature Aggregation Networks for Super-Resolution

Mar 16, 2022

Jinsu Yoo, Taehoon Kim, Sihaeng Lee, Seung Hwan Kim, Honglak Lee, Tae Hyun Kim

Figure 1 for Rich CNN-Transformer Feature Aggregation Networks for Super-Resolution

Figure 2 for Rich CNN-Transformer Feature Aggregation Networks for Super-Resolution

Figure 3 for Rich CNN-Transformer Feature Aggregation Networks for Super-Resolution

Figure 4 for Rich CNN-Transformer Feature Aggregation Networks for Super-Resolution

Abstract:Recent vision transformers along with self-attention have achieved promising results on various computer vision tasks. In particular, a pure transformer-based image restoration architecture surpasses the existing CNN-based methods using multi-task pre-training with a large number of trainable parameters. In this paper, we introduce an effective hybrid architecture for super-resolution (SR) tasks, which leverages local features from CNNs and long-range dependencies captured by transformers to further improve the SR results. Specifically, our architecture comprises of transformer and convolution branches, and we substantially elevate the performance by mutually fusing two branches to complement each representation. Furthermore, we propose a cross-scale token attention module, which allows the transformer to efficiently exploit the informative relationships among tokens across different scales. Our proposed method achieves state-of-the-art SR results on numerous benchmark datasets.

* 19 pages, 11 figures, preprint

Via

Access Paper or Ask Questions

Restore from Restored: Single-image Inpainting

Oct 25, 2021

Eunhye Lee, Jeongmu Kim, Jisu Kim, Tae Hyun Kim

Figure 1 for Restore from Restored: Single-image Inpainting

Figure 2 for Restore from Restored: Single-image Inpainting

Figure 3 for Restore from Restored: Single-image Inpainting

Figure 4 for Restore from Restored: Single-image Inpainting

Abstract:Recent image inpainting methods have shown promising results due to the power of deep learning, which can explore external information available from the large training dataset. However, many state-of-the-art inpainting networks are still limited in exploiting internal information available in the given input image at test time. To mitigate this problem, we present a novel and efficient self-supervised fine-tuning algorithm that can adapt the parameters of fully pre-trained inpainting networks without using ground-truth target images. We update the parameters of the pre-trained state-of-the-art inpainting networks by utilizing existing self-similar patches (i.e., self-exemplars) within the given input image without changing the network architecture and improve the inpainting quality by a large margin. Qualitative and quantitative experimental results demonstrate the superiority of the proposed algorithm, and we achieve state-of-the-art inpainting results on publicly available benchmark datasets.

* arXiv admin note: substantial text overlap with arXiv:2102.08078

Via

Access Paper or Ask Questions

Self-Supervised Adaptation for Video Super-Resolution

Mar 18, 2021

Jinsu Yoo, Tae Hyun Kim

Figure 1 for Self-Supervised Adaptation for Video Super-Resolution

Figure 2 for Self-Supervised Adaptation for Video Super-Resolution

Figure 3 for Self-Supervised Adaptation for Video Super-Resolution

Figure 4 for Self-Supervised Adaptation for Video Super-Resolution

Abstract:Recent single-image super-resolution (SISR) networks, which can adapt their network parameters to specific input images, have shown promising results by exploiting the information available within the input data as well as large external datasets. However, the extension of these self-supervised SISR approaches to video handling has yet to be studied. Thus, we present a new learning algorithm that allows conventional video super-resolution (VSR) networks to adapt their parameters to test video frames without using the ground-truth datasets. By utilizing many self-similar patches across space and time, we improve the performance of fully pre-trained VSR networks and produce temporally consistent video frames. Moreover, we present a test-time knowledge distillation technique that accelerates the adaptation speed with less hardware resources. In our experiments, we demonstrate that our novel learning algorithm can fine-tune state-of-the-art VSR networks and substantially elevate performance on numerous benchmark datasets.

Via

Access Paper or Ask Questions

Image Restoration by Solving IVP

Feb 05, 2021

Seobin Park, Tae Hyun Kim

Figure 1 for Image Restoration by Solving IVP

Figure 2 for Image Restoration by Solving IVP

Figure 3 for Image Restoration by Solving IVP

Figure 4 for Image Restoration by Solving IVP

Abstract:Recent research on image restoration have achieved great success with the aid of deep learning technologies, but, many of them are limited to dealing SR with realistic settings. To alleviate this problem, we introduce a new formulation for image super-resolution to solve arbitrary scale image super-resolution methods. Based on the proposed new SR formulation, we can not only super-resolve images with multiple scales, but also find a new way to analyze the performance of super-resolving process. We demonstrate that the proposed method can generate high-quality images unlike conventional SR methods.

* Revision on the abstract and main text; Remove first figure, table

Via

Access Paper or Ask Questions