Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sungik Choi

HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images

Dec 30, 2024

Sungik Choi, Sungwoo Park, Jaehoon Lee, Seunghyun Kim, Stanley Jungkyu Choi, Moontae Lee

Abstract:Dramatic advances in the quality of the latent diffusion models (LDMs) also led to the malicious use of AI-generated images. While current AI-generated image detection methods assume the availability of real/AI-generated images for training, this is practically limited given the vast expressibility of LDMs. This motivates the training-free detection setup where no related data are available in advance. The existing LDM-generated image detection method assumes that images generated by LDM are easier to reconstruct using an autoencoder than real images. However, we observe that this reconstruction distance is overfitted to background information, leading the current method to underperform in detecting images with simple backgrounds. To address this, we propose a novel method called HFI. Specifically, by viewing the autoencoder of LDM as a downsampling-upsampling kernel, HFI measures the extent of aliasing, a distortion of high-frequency information that appears in the reconstructed image. HFI is training-free, efficient, and consistently outperforms other training-free methods in detecting challenging images generated by various generative models. We also show that HFI can successfully detect the images generated from the specified LDM as a means of implicit watermarking. HFI outperforms the best baseline method while achieving magnitudes of

Via

Access Paper or Ask Questions

Diffusion based Semantic Outlier Generation via Nuisance Awareness for Out-of-Distribution Detection

Aug 27, 2024

Suhee Yoon, Sanghyu Yoon, Hankook Lee, Ye Seul Sim, Sungik Choi, Kyungeun Lee, Hye-Seung Cho, Woohyung Lim

Figure 1 for Diffusion based Semantic Outlier Generation via Nuisance Awareness for Out-of-Distribution Detection

Figure 2 for Diffusion based Semantic Outlier Generation via Nuisance Awareness for Out-of-Distribution Detection

Figure 3 for Diffusion based Semantic Outlier Generation via Nuisance Awareness for Out-of-Distribution Detection

Figure 4 for Diffusion based Semantic Outlier Generation via Nuisance Awareness for Out-of-Distribution Detection

Abstract:Out-of-distribution (OOD) detection, which determines whether a given sample is part of the in-distribution (ID), has recently shown promising results through training with synthetic OOD datasets. Nonetheless, existing methods often produce outliers that are considerably distant from the ID, showing limited efficacy for capturing subtle distinctions between ID and OOD. To address these issues, we propose a novel framework, Semantic Outlier generation via Nuisance Awareness (SONA), which notably produces challenging outliers by directly leveraging pixel-space ID samples through diffusion models. Our approach incorporates SONA guidance, providing separate control over semantic and nuisance regions of ID samples. Thereby, the generated outliers achieve two crucial properties: (i) they present explicit semantic-discrepant information, while (ii) maintaining various levels of nuisance resemblance with ID. Furthermore, the improved OOD detector training with SONA outliers facilitates learning with a focus on semantic distinctions. Extensive experiments demonstrate the effectiveness of our framework, achieving an impressive AUROC of 88% on near-OOD datasets, which surpasses the performance of baseline methods by a significant margin of approximately 6%.

Via

Access Paper or Ask Questions

Partial-Multivariate Model for Forecasting

Aug 19, 2024

Jaehoon Lee, Hankook Lee, Sungik Choi, Sungjun Cho, Moontae Lee

Figure 1 for Partial-Multivariate Model for Forecasting

Figure 2 for Partial-Multivariate Model for Forecasting

Figure 3 for Partial-Multivariate Model for Forecasting

Figure 4 for Partial-Multivariate Model for Forecasting

Abstract:When solving forecasting problems including multiple time-series features, existing approaches often fall into two extreme categories, depending on whether to utilize inter-feature information: univariate and complete-multivariate models. Unlike univariate cases which ignore the information, complete-multivariate models compute relationships among a complete set of features. However, despite the potential advantage of leveraging the additional information, complete-multivariate models sometimes underperform univariate ones. Therefore, our research aims to explore a middle ground between these two by introducing what we term Partial-Multivariate models where a neural network captures only partial relationships, that is, dependencies within subsets of all features. To this end, we propose PMformer, a Transformer-based partial-multivariate model, with its training algorithm. We demonstrate that PMformer outperforms various univariate and complete-multivariate models, providing a theoretical rationale and empirical analysis for its superiority. Additionally, by proposing an inference technique for PMformer, the forecasting accuracy is further enhanced. Finally, we highlight other advantages of PMformer: efficiency and robustness under missing features.

* 25 pages

Via

Access Paper or Ask Questions

Learning Equi-angular Representations for Online Continual Learning

Apr 02, 2024

Minhyuk Seo, Hyunseo Koh, Wonje Jeung, Minjae Lee, San Kim, Hankook Lee, Sungjun Cho, Sungik Choi, Hyunwoo Kim, Jonghyun Choi

Figure 1 for Learning Equi-angular Representations for Online Continual Learning

Figure 2 for Learning Equi-angular Representations for Online Continual Learning

Figure 3 for Learning Equi-angular Representations for Online Continual Learning

Figure 4 for Learning Equi-angular Representations for Online Continual Learning

Abstract:Online continual learning suffers from an underfitted solution due to insufficient training for prompt model update (e.g., single-epoch training). To address the challenge, we propose an efficient online continual learning method using the neural collapse phenomenon. In particular, we induce neural collapse to form a simplex equiangular tight frame (ETF) structure in the representation space so that the continuously learned model with a single epoch can better fit to the streamed data by proposing preparatory data training and residual correction in the representation space. With an extensive set of empirical validations using CIFAR-10/100, TinyImageNet, ImageNet-200, and ImageNet-1K, we show that our proposed method outperforms state-of-the-art methods by a noticeable margin in various online continual learning scenarios such as disjoint and Gaussian scheduled continuous (i.e., boundary-free) data setups.

* CVPR 2024

Via

Access Paper or Ask Questions

Projection Regret: Reducing Background Bias for Novelty Detection via Diffusion Models

Dec 05, 2023

Sungik Choi, Hankook Lee, Honglak Lee, Moontae Lee

Abstract:Novelty detection is a fundamental task of machine learning which aims to detect abnormal ($\textit{i.e.}$ out-of-distribution (OOD)) samples. Since diffusion models have recently emerged as the de facto standard generative framework with surprising generation results, novelty detection via diffusion models has also gained much attention. Recent methods have mainly utilized the reconstruction property of in-distribution samples. However, they often suffer from detecting OOD samples that share similar background information to the in-distribution data. Based on our observation that diffusion models can \emph{project} any sample to an in-distribution sample with similar background information, we propose \emph{Projection Regret (PR)}, an efficient novelty detection method that mitigates the bias of non-semantic information. To be specific, PR computes the perceptual distance between the test image and its diffusion-based projection to detect abnormality. Since the perceptual distance often fails to capture semantic changes when the background information is dominant, we cancel out the background bias by comparing it against recursive projections. Extensive experiments demonstrate that PR outperforms the prior art of generative-model-based novelty detection methods by a significant margin.

* NeurIPS 2023

Via

Access Paper or Ask Questions

Observation-Guided Diffusion Probabilistic Models

Oct 06, 2023

Junoh Kang, Jinyoung Choi, Sungik Choi, Bohyung Han

Figure 1 for Observation-Guided Diffusion Probabilistic Models

Figure 2 for Observation-Guided Diffusion Probabilistic Models

Figure 3 for Observation-Guided Diffusion Probabilistic Models

Figure 4 for Observation-Guided Diffusion Probabilistic Models

Abstract:We propose a novel diffusion model called observation-guided diffusion probabilistic model (OGDM), which effectively addresses the trade-off between quality control and fast sampling. Our approach reestablishes the training objective by integrating the guidance of the observation process with the Markov chain in a principled way. This is achieved by introducing an additional loss term derived from the observation based on the conditional discriminator on noise level, which employs Bernoulli distribution indicating whether its input lies on the (noisy) real manifold or not. This strategy allows us to optimize the more accurate negative log-likelihood induced in the inference stage especially when the number of function evaluations is limited. The proposed training method is also advantageous even when incorporated only into the fine-tuning process, and it is compatible with various fast inference strategies since our method yields better denoising networks using the exactly same inference procedure without incurring extra computational cost. We demonstrate the effectiveness of the proposed training algorithm using diverse inference methods on strong diffusion model baselines.

Via

Access Paper or Ask Questions

Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching

Jan 07, 2023

Byoungjip Kim, Sungik Choi, Dasol Hwang, Moontae Lee, Honglak Lee

Figure 1 for Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching

Figure 2 for Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching

Figure 3 for Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching

Figure 4 for Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching

Abstract:Despite surprising performance on zero-shot transfer, pre-training a large-scale multimodal model is often prohibitive as it requires a huge amount of data and computing resources. In this paper, we propose a method (BeamCLIP) that can effectively transfer the representations of a large pre-trained multimodal model (CLIP-ViT) into a small target model (e.g., ResNet-18). For unsupervised transfer, we introduce cross-modal similarity matching (CSM) that enables a student model to learn the representations of a teacher model by matching the relative similarity distribution across text prompt embeddings. To better encode the text prompts, we design context-based prompt augmentation (CPA) that can alleviate the lexical ambiguity of input text prompts. Our experiments show that unsupervised representation transfer of a pre-trained vision-language model enables a small ResNet-18 to achieve a better ImageNet-1K top-1 linear probe accuracy (66.2%) than vision-only self-supervised learning (SSL) methods (e.g., SimCLR: 51.8%, SwAV: 63.7%), while closing the gap with supervised learning (69.8%).

* 20 pages, 10 figures, NeurIPS 2022

Via

Access Paper or Ask Questions

Unsupervised Visual Representation Learning via Mutual Information Regularized Assignment

Nov 04, 2022

Dong Hoon Lee, Sungik Choi, Hyunwoo Kim, Sae-Young Chung

Abstract:This paper proposes Mutual Information Regularized Assignment (MIRA), a pseudo-labeling algorithm for unsupervised representation learning inspired by information maximization. We formulate online pseudo-labeling as an optimization problem to find pseudo-labels that maximize the mutual information between the label and data while being close to a given model probability. We derive a fixed-point iteration method and prove its convergence to the optimal solution. In contrast to baselines, MIRA combined with pseudo-label prediction enables a simple yet effective clustering-based representation learning without incorporating extra training techniques or artificial constraints such as sampling strategy, equipartition constraints, etc. With relatively small training epochs, representation learned by MIRA achieves state-of-the-art performance on various downstream tasks, including the linear/k-NN evaluation and transfer learning. Especially, with only 400 epochs, our method applied to ImageNet dataset with ResNet-50 architecture achieves 75.6% linear evaluation accuracy.

* NeurIPS 2022

Via

Access Paper or Ask Questions

Novelty Detection Via Blurring

Jan 07, 2020

Sungik Choi, Sae-Young Chung

Figure 1 for Novelty Detection Via Blurring

Figure 2 for Novelty Detection Via Blurring

Figure 3 for Novelty Detection Via Blurring

Figure 4 for Novelty Detection Via Blurring

Abstract:Conventional out-of-distribution (OOD) detection schemes based on variational autoencoder or Random Network Distillation (RND) have been observed to assign lower uncertainty to the OOD than the target distribution. In this work, we discover that such conventional novelty detection schemes are also vulnerable to the blurred images. Based on the observation, we construct a novel RND-based OOD detector, SVD-RND, that utilizes blurred images during training. Our detector is simple, efficient at test time, and outperforms baseline OOD detectors in various domains. Further results show that SVD-RND learns better target distribution representation than the baseline RND algorithm. Finally, SVD-RND combined with geometric transform achieves near-perfect detection accuracy on the CelebA dataset.

* ICLR 2020

Via

Access Paper or Ask Questions

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

May 31, 2018

Su Young Lee, Sungik Choi, Sae-Young Chung

Figure 1 for Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Figure 2 for Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Figure 3 for Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Figure 4 for Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Abstract:We propose Episodic Backward Update - a new algorithm to boost the performance of a deep reinforcement learning agent by a fast reward propagation. In contrast to the conventional use of the experience replay with uniform random sampling, our agent samples a whole episode and successively propagates the value of a state to its previous states. Our computationally efficient recursive algorithm allows sparse and delayed rewards to propagate efficiently through all transitions of a sampled episode. We evaluate our algorithm on 2D MNIST Maze environment and 49 games of the Atari 2600 environment and show that our method improves sample efficiency with a competitive amount of computational cost.

Via

Access Paper or Ask Questions