Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yi-Chen Lo

GCC: Generative Color Constancy via Diffusing a Color Checker

Feb 24, 2025

Chen-Wei Chang, Cheng-De Fan, Chia-Che Chang, Yi-Chen Lo, Yu-Chee Tseng, Jiun-Long Huang, Yu-Lun Liu

Figure 1 for GCC: Generative Color Constancy via Diffusing a Color Checker

Figure 2 for GCC: Generative Color Constancy via Diffusing a Color Checker

Figure 3 for GCC: Generative Color Constancy via Diffusing a Color Checker

Figure 4 for GCC: Generative Color Constancy via Diffusing a Color Checker

Abstract:Color constancy methods often struggle to generalize across different camera sensors due to varying spectral sensitivities. We present GCC, which leverages diffusion models to inpaint color checkers into images for illumination estimation. Our key innovations include (1) a single-step deterministic inference approach that inpaints color checkers reflecting scene illumination, (2) a Laplacian decomposition technique that preserves checker structure while allowing illumination-dependent color adaptation, and (3) a mask-based data augmentation strategy for handling imprecise color checker annotations. GCC demonstrates superior robustness in cross-camera scenarios, achieving state-of-the-art worst-25% error rates of 5.15{\deg} and 4.32{\deg} in bi-directional evaluations. These results highlight our method's stability and generalization capability across different camera characteristics without requiring sensor-specific training, making it a versatile solution for real-world applications.

* Project page: https://chenwei891213.github.io/GCC/

Via

Access Paper or Ask Questions

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Mar 30, 2024

Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang, Hao-Wei Chen, Roy Tseng, Chien Feng, Chun-Yi Lee

Figure 1 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Figure 2 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Figure 3 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Figure 4 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Abstract:Flow-based super-resolution (SR) models have demonstrated astonishing capabilities in generating high-quality images. However, these methods encounter several challenges during image generation, such as grid artifacts, exploding inverses, and suboptimal results due to a fixed sampling temperature. To overcome these issues, this work introduces a conditional learned prior to the inference phase of a flow-based SR model. This prior is a latent code predicted by our proposed latent module conditioned on the low-resolution image, which is then transformed by the flow model into an SR image. Our framework is designed to seamlessly integrate with any contemporary flow-based SR model without modifying its architecture or pre-trained weights. We evaluate the effectiveness of our proposed framework through extensive experiments and ablation analyses. The proposed framework successfully addresses all the inherent issues in flow-based SR models and enhances their performance in various SR scenarios. Our code is available at: https://github.com/liyuantsao/FlowSR-LP

* Accepted to CVPR2024

Via

Access Paper or Ask Questions

Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution

Mar 09, 2023

Jie-En Yao, Li-Yuan Tsao, Yi-Chen Lo, Roy Tseng, Chia-Che Chang, Chun-Yi Lee

Figure 1 for Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution

Figure 2 for Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution

Figure 3 for Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution

Figure 4 for Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution

Abstract:Flow-based methods have demonstrated promising results in addressing the ill-posed nature of super-resolution (SR) by learning the distribution of high-resolution (HR) images with the normalizing flow. However, these methods can only perform a predefined fixed-scale SR, limiting their potential in real-world applications. Meanwhile, arbitrary-scale SR has gained more attention and achieved great progress. Nonetheless, previous arbitrary-scale SR methods ignore the ill-posed problem and train the model with per-pixel L1 loss, leading to blurry SR outputs. In this work, we propose "Local Implicit Normalizing Flow" (LINF) as a unified solution to the above problems. LINF models the distribution of texture details under different scaling factors with normalizing flow. Thus, LINF can generate photo-realistic HR images with rich texture details in arbitrary scale factors. We evaluate LINF with extensive experiments and show that LINF achieves the state-of-the-art perceptual quality compared with prior arbitrary-scale SR methods.

* Accepted to CVPR 2023

Via

Access Paper or Ask Questions

ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDA

Nov 16, 2022

Ting-Hsuan Liao, Huang-Ru Liao, Shan-Ya Yang, Jie-En Yao, Li-Yuan Tsao, Hsu-Shen Liu, Bo-Wun Cheng, Chen-Hao Chao, Chia-Che Chang, Yi-Chen Lo(+1 more)

Figure 1 for ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDA

Figure 2 for ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDA

Figure 3 for ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDA

Figure 4 for ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDA

Abstract:Many unsupervised domain adaptation (UDA) methods have been proposed to bridge the domain gap by utilizing domain invariant information. Most approaches have chosen depth as such information and achieved remarkable success. Despite their effectiveness, using depth as domain invariant information in UDA tasks may lead to multiple issues, such as excessively high extraction costs and difficulties in achieving a reliable prediction quality. As a result, we introduce Edge Learning based Domain Adaptation (ELDA), a framework which incorporates edge information into its training process to serve as a type of domain invariant information. In our experiments, we quantitatively and qualitatively demonstrate that the incorporation of edge information is indeed beneficial and effective and enables ELDA to outperform the contemporary state-of-the-art methods on two commonly adopted benchmarks for semantic segmentation based UDA tasks. In addition, we show that ELDA is able to better separate the feature distributions of different classes. We further provide an ablation analysis to justify our design decisions.

* Accepted by BMVC2022. Ting-Hsuan Liao and Huang-Ru Liao contributed equally to this work

Via

Access Paper or Ask Questions

Denoising Likelihood Score Matching for Conditional Score-based Data Generation

Mar 27, 2022

Chen-Hao Chao, Wei-Fang Sun, Bo-Wun Cheng, Yi-Chen Lo, Chia-Che Chang, Yu-Lun Liu, Yu-Lin Chang, Chia-Ping Chen, Chun-Yi Lee

Figure 1 for Denoising Likelihood Score Matching for Conditional Score-based Data Generation

Figure 2 for Denoising Likelihood Score Matching for Conditional Score-based Data Generation

Figure 3 for Denoising Likelihood Score Matching for Conditional Score-based Data Generation

Figure 4 for Denoising Likelihood Score Matching for Conditional Score-based Data Generation

Abstract:Many existing conditional score-based data generation methods utilize Bayes' theorem to decompose the gradients of a log posterior density into a mixture of scores. These methods facilitate the training procedure of conditional score models, as a mixture of scores can be separately estimated using a score model and a classifier. However, our analysis indicates that the training objectives for the classifier in these methods may lead to a serious score mismatch issue, which corresponds to the situation that the estimated scores deviate from the true ones. Such an issue causes the samples to be misled by the deviated scores during the diffusion process, resulting in a degraded sampling quality. To resolve it, we formulate a novel training objective, called Denoising Likelihood Score Matching (DLSM) loss, for the classifier to match the gradients of the true log likelihood density. Our experimental evidence shows that the proposed method outperforms the previous methods on both Cifar-10 and Cifar-100 benchmarks noticeably in terms of several key evaluation metrics. We thus conclude that, by adopting DLSM, the conditional scores can be accurately modeled, and the effect of the score mismatch issue is alleviated.

* ICLR 2022

Via

Access Paper or Ask Questions

CLCC: Contrastive Learning for Color Constancy

Jun 09, 2021

Yi-Chen Lo, Chia-Che Chang, Hsuan-Chao Chiu, Yu-Hao Huang, Chia-Ping Chen, Yu-Lin Chang, Kevin Jou

Figure 1 for CLCC: Contrastive Learning for Color Constancy

Figure 2 for CLCC: Contrastive Learning for Color Constancy

Figure 3 for CLCC: Contrastive Learning for Color Constancy

Figure 4 for CLCC: Contrastive Learning for Color Constancy

Abstract:In this paper, we present CLCC, a novel contrastive learning framework for color constancy. Contrastive learning has been applied for learning high-quality visual representations for image classification. One key aspect to yield useful representations for image classification is to design illuminant invariant augmentations. However, the illuminant invariant assumption conflicts with the nature of the color constancy task, which aims to estimate the illuminant given a raw image. Therefore, we construct effective contrastive pairs for learning better illuminant-dependent features via a novel raw-domain color augmentation. On the NUS-8 dataset, our method provides $17.5\%$ relative improvements over a strong baseline, reaching state-of-the-art performance without increasing model complexity. Furthermore, our method achieves competitive performance on the Gehler dataset with $3\times$ fewer parameters compared to top-ranking deep learning methods. More importantly, we show that our model is more robust to different scenes under close proximity of illuminants, significantly reducing $28.7\%$ worst-case error in data-sparse regions.

* Accepted at CVPR 2021. Our code is available at https://github.com/howardyclo/clcc-cvpr21

Via

Access Paper or Ask Questions

One-Shot Object Detection with Co-Attention and Co-Excitation

Nov 28, 2019

Ting-I Hsieh, Yi-Chen Lo, Hwann-Tzong Chen, Tyng-Luh Liu

Figure 1 for One-Shot Object Detection with Co-Attention and Co-Excitation

Figure 2 for One-Shot Object Detection with Co-Attention and Co-Excitation

Figure 3 for One-Shot Object Detection with Co-Attention and Co-Excitation

Figure 4 for One-Shot Object Detection with Co-Attention and Co-Excitation

Abstract:This paper aims to tackle the challenging problem of one-shot object detection. Given a query image patch whose class label is not included in the training data, the goal of the task is to detect all instances of the same class in a target image. To this end, we develop a novel {\em co-attention and co-excitation} (CoAE) framework that makes contributions in three key technical aspects. First, we propose to use the non-local operation to explore the co-attention embodied in each query-target pair and yield region proposals accounting for the one-shot situation. Second, we formulate a squeeze-and-co-excitation scheme that can adaptively emphasize correlated feature channels to help uncover relevant proposals and eventually the target objects. Third, we design a margin-based ranking loss for implicitly learning a metric to predict the similarity of a region proposal to the underlying query, no matter its class label is seen or unseen in training. The resulting model is therefore a two-stage detector that yields a strong baseline on both VOC and MS-COCO under one-shot setting of detecting objects from both seen and never-seen classes. Codes are available at https://github.com/timy90022/One-Shot-Object-Detection.

* NeurIPS 2019

Via

Access Paper or Ask Questions