Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Markus Heinonen

Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models

Feb 09, 2025

Rafał Karczewski, Markus Heinonen, Vikas Garg

Abstract:Diffusion models have emerged as a powerful class of generative models, capable of producing high-quality images by mapping noise to a data distribution. However, recent findings suggest that image likelihood does not align with perceptual quality: high-likelihood samples tend to be smooth, while lower-likelihood ones are more detailed. Controlling sample density is thus crucial for balancing realism and detail. In this paper, we analyze an existing technique, Prior Guidance, which scales the latent code to influence image detail. We introduce score alignment, a condition that explains why this method works and show that it can be tractably checked for any continuous normalizing flow model. We then propose Density Guidance, a principled modification of the generative ODE that enables exact log-density control during sampling. Finally, we extend Density Guidance to stochastic sampling, ensuring precise log-density control while allowing controlled variation in structure or fine details. Our experiments demonstrate that these techniques provide fine-grained control over image detail without compromising sample quality.

* 27 pages, 15 figures

Via

Access Paper or Ask Questions

Diffusion Models as Cartoonists! The Curious Case of High Density Regions

Nov 02, 2024

Rafał Karczewski, Markus Heinonen, Vikas Garg

Abstract:We investigate what kind of images lie in the high-density regions of diffusion models. We introduce a theoretical mode-tracking process capable of pinpointing the exact mode of the denoising distribution, and we propose a practical high-probability sampler that consistently generates images of higher likelihood than usual samplers. Our empirical findings reveal the existence of significantly higher likelihood samples that typical samplers do not produce, often manifesting as cartoon-like drawings or blurry images depending on the noise level. Curiously, these patterns emerge in datasets devoid of such examples. We also present a novel approach to track sample likelihoods in diffusion SDEs, which remarkably incurs no additional computational cost.

Via

Access Paper or Ask Questions

Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs

Oct 15, 2024

Severi Rissanen, Markus Heinonen, Arno Solin

Abstract:The covariance for clean data given a noisy observation is an important quantity in many conditional generation methods for diffusion models. Current methods require heavy test-time computation, altering the standard diffusion training process or denoiser architecture, or making heavy approximations. We propose a new framework that sidesteps these issues by using covariance information that is available for free from training data and the curvature of the generative trajectory, which is linked to the covariance through the second-order Tweedie's formula. We integrate these sources of information using {\em (i)} a novel method to transfer covariance estimates across noise levels and (ii) low-rank updates in a given noise level. We validate the method on linear inverse problems, where it outperforms recent baselines, especially with fewer diffusion steps.

* 24 pages, 11 figures

Via

Access Paper or Ask Questions

What Ails Generative Structure-based Drug Design: Too Little or Too Much Expressivity?

Aug 12, 2024

Rafał Karczewski, Samuel Kaski, Markus Heinonen, Vikas Garg

Figure 1 for What Ails Generative Structure-based Drug Design: Too Little or Too Much Expressivity?

Figure 2 for What Ails Generative Structure-based Drug Design: Too Little or Too Much Expressivity?

Figure 3 for What Ails Generative Structure-based Drug Design: Too Little or Too Much Expressivity?

Figure 4 for What Ails Generative Structure-based Drug Design: Too Little or Too Much Expressivity?

Abstract:Several generative models with elaborate training and sampling procedures have been proposed recently to accelerate structure-based drug design (SBDD); however, perplexingly, their empirical performance turns out to be suboptimal. We seek to better understand this phenomenon from both theoretical and empirical perspectives. Since most of these models apply graph neural networks (GNNs), one may suspect that they inherit the representational limitations of GNNs. We analyze this aspect, establishing the first such results for protein-ligand complexes. A plausible counterview may attribute the underperformance of these models to their excessive parameterizations, inducing expressivity at the expense of generalization. We also investigate this possibility with a simple metric-aware approach that learns an economical surrogate for affinity to infer an unlabelled molecular graph and optimizes for labels conditioned on this graph and molecular properties. The resulting model achieves state-of-the-art results using 100x fewer trainable parameters and affords up to 1000x speedup. Collectively, our findings underscore the need to reassess and redirect the existing paradigm and efforts for SBDD.

* 25 pages, 11 figures

Via

Access Paper or Ask Questions

Improving robustness to corruptions with multiplicative weight perturbations

Jun 24, 2024

Trung Trinh, Markus Heinonen, Luigi Acerbi, Samuel Kaski

Figure 1 for Improving robustness to corruptions with multiplicative weight perturbations

Figure 2 for Improving robustness to corruptions with multiplicative weight perturbations

Figure 3 for Improving robustness to corruptions with multiplicative weight perturbations

Figure 4 for Improving robustness to corruptions with multiplicative weight perturbations

Abstract:Deep neural networks (DNNs) excel on clean images but struggle with corrupted ones. Incorporating specific corruptions into the data augmentation pipeline can improve robustness to those corruptions but may harm performance on clean images and other types of distortion. In this paper, we introduce an alternative approach that improves the robustness of DNNs to a wide range of corruptions without compromising accuracy on clean images. We first demonstrate that input perturbations can be mimicked by multiplicative perturbations in the weight space. Leveraging this, we propose Data Augmentation via Multiplicative Perturbation (DAMP), a training method that optimizes DNNs under random multiplicative weight perturbations. We also examine the recently proposed Adaptive Sharpness-Aware Minimization (ASAM) and show that it optimizes DNNs under adversarial multiplicative weight perturbations. Experiments on image classification datasets (CIFAR-10/100, TinyImageNet and ImageNet) and neural network architectures (ResNet50, ViT-S/16) show that DAMP enhances model generalization performance in the presence of corruptions across different settings. Notably, DAMP is able to train a ViT-S/16 on ImageNet from scratch, reaching the top-1 error of 23.7% which is comparable to ResNet50 without extensive data augmentations.

* Under review

Via

Access Paper or Ask Questions

Robust Classification by Coupling Data Mollification with Label Smoothing

Jun 03, 2024

Markus Heinonen, Ba-Hien Tran, Michael Kampffmeyer, Maurizio Filippone

Figure 1 for Robust Classification by Coupling Data Mollification with Label Smoothing

Figure 2 for Robust Classification by Coupling Data Mollification with Label Smoothing

Figure 3 for Robust Classification by Coupling Data Mollification with Label Smoothing

Figure 4 for Robust Classification by Coupling Data Mollification with Label Smoothing

Abstract:Introducing training-time augmentations is a key technique to enhance generalization and prepare deep neural networks against test-time corruptions. Inspired by the success of generative diffusion models, we propose a novel approach coupling data augmentation, in the form of image noising and blurring, with label smoothing to align predicted label confidences with image degradation. The method is simple to implement, introduces negligible overheads, and can be combined with existing augmentations. We demonstrate improved robustness and uncertainty quantification on the corrupted image benchmarks of the CIFAR and TinyImageNet datasets.

* Under review

Via

Access Paper or Ask Questions

Improving Discrete Diffusion Models via Structured Preferential Generation

May 28, 2024

Severi Rissanen, Markus Heinonen, Arno Solin

Abstract:In the domains of image and audio, diffusion models have shown impressive performance. However, their application to discrete data types, such as language, has often been suboptimal compared to autoregressive generative models. This paper tackles the challenge of improving discrete diffusion models by introducing a structured forward process that leverages the inherent information hierarchy in discrete categories, such as words in text. Our approach biases the generative process to produce certain categories before others, resulting in a notable improvement in log-likelihood scores on the text8 dataset. This work paves the way for more advances in discrete diffusion models with potentially significant enhancements in performance.

* 10 pages, 7 figures

Via

Access Paper or Ask Questions

Alignment is Key for Applying Diffusion Models to Retrosynthesis

May 27, 2024

Najwa Laabid, Severi Rissanen, Markus Heinonen, Arno Solin, Vikas Garg

Abstract:Retrosynthesis, the task of identifying precursors for a given molecule, can be naturally framed as a conditional graph generation task. Diffusion models are a particularly promising modelling approach, enabling post-hoc conditioning and trading off quality for speed during generation. We show mathematically that permutation equivariant denoisers severely limit the expressiveness of graph diffusion models and thus their adaptation to retrosynthesis. To address this limitation, we relax the equivariance requirement such that it only applies to aligned permutations of the conditioning and the generated graphs obtained through atom mapping. Our new denoiser achieves the highest top-$1$ accuracy ($54.7$\%) across template-free and template-based methods on USPTO-50k. We also demonstrate the ability for flexible post-training conditioning and good sample quality with small diffusion step counts, highlighting the potential for interactive applications and additional controls for multi-step planning.

* 28 pages, 9 figures

Via

Access Paper or Ask Questions

ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs

Apr 15, 2024

Yogesh Verma, Markus Heinonen, Vikas Garg

Abstract:Climate and weather prediction traditionally relies on complex numerical simulations of atmospheric physics. Deep learning approaches, such as transformers, have recently challenged the simulation paradigm with complex network forecasts. However, they often act as data-driven black-box models that neglect the underlying physics and lack uncertainty quantification. We address these limitations with ClimODE, a spatiotemporal continuous-time process that implements a key principle of advection from statistical mechanics, namely, weather changes due to a spatial movement of quantities over time. ClimODE models precise weather evolution with value-conserving dynamics, learning global weather transport as a neural flow, which also enables estimating the uncertainty in predictions. Our approach outperforms existing data-driven methods in global and regional forecasting with an order of magnitude smaller parameterization, establishing a new state of the art.

* Accepted as ICLR 2024 Oral. Project website: https://yogeshverma1998.github.io/ClimODE/

Via

Access Paper or Ask Questions

Field-based Molecule Generation

Feb 24, 2024

Alexandru Dumitrescu, Dani Korpela, Markus Heinonen, Yogesh Verma, Valerii Iakovlev, Vikas Garg, Harri Lähdesmäki

Figure 1 for Field-based Molecule Generation

Figure 2 for Field-based Molecule Generation

Figure 3 for Field-based Molecule Generation

Figure 4 for Field-based Molecule Generation

Abstract:This work introduces FMG, a field-based model for drug-like molecule generation. We show how the flexibility of this method provides crucial advantages over the prevalent, point-cloud based methods, and achieves competitive molecular stability generation. We tackle optical isomerism (enantiomers), a previously omitted molecular property that is crucial for drug safety and effectiveness, and thus account for all molecular geometry aspects. We demonstrate how previous methods are invariant to a group of transformations that includes enantiomer pairs, leading them invariant to the molecular R and S configurations, while our field-based generative model captures this property.

* 15 pages, 14 figures

Via

Access Paper or Ask Questions