Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael U. Gutmann

Learning Energy-Based Models from Stochastic Interpolants using Spatiotemporal Differences

May 26, 2026

Hanlin Yu, RuiKang OuYang, Partha Kaushik, Arto Klami, Michael U. Gutmann, Omar Chehab

Abstract:Learning an energy-based model from data samples is a central problem in machine learning. Many recent and popular methods, such as denoising score matching for training energy-based diffusion models, use stochastic interpolants to corrupt data samples at different noise levels indexed by a time variable. This defines a joint density over both the data space and time, and most methods learn its energy through either spatial or temporal differences. We identify distinct failure modes for both of these approaches. To solve them, we propose Spatiotemporal Noise-Contrastive Estimation (stNCE), a framework for learning the energy through joint spatiotemporal differences. stNCE unifies many existing methods and leads to new training objectives. Experiments on images and molecules demonstrate performance competitive with state-of-the-art density estimation methods.

Via

Access Paper or Ask Questions

Sequential Design of Genetic Circuits Under Uncertainty With Reinforcement Learning

May 07, 2026

Michal Kobiela, Diego A. Oyarzún, Michael U. Gutmann

Abstract:The design of biological systems is hindered by uncertainty arising from both intrinsic stochasticity of biomolecular reactions and variability across laboratory or experimental conditions. In this work, we present a sequential framework to optimize genetic circuits under both forms of uncertainty. By employing simulator models based on differential equations or Markov jump processes alongside a reinforcement learning (RL) policy-based approach, our method suggests experiments that adapt to unknown laboratory conditions while accounting for inherent stochasticity. While previous Bayesian methods address uncertainty through iterative experiment-inference-optimization cycles, they typically require computationally expensive inference and optimization steps after each experimental round, leading to delays. To overcome this bottleneck, we propose an amortized approach trained up-front across a distribution of possible uncertain parameters. This strategy sidesteps the need for explicit parameter inference during the design cycle, enabling immediate, observation-based adaptation. We demonstrate our framework on models for heterologous gene expression and a repressilator circuit, showing that it efficiently handles both molecular noise and cross-laboratory variability.

Via

Access Paper or Ask Questions

CFMI: Flow Matching for Missing Data Imputation

Jun 10, 2025

Vaidotas Simkus, Michael U. Gutmann

Figure 1 for CFMI: Flow Matching for Missing Data Imputation

Figure 2 for CFMI: Flow Matching for Missing Data Imputation

Figure 3 for CFMI: Flow Matching for Missing Data Imputation

Figure 4 for CFMI: Flow Matching for Missing Data Imputation

Abstract:We introduce conditional flow matching for imputation (CFMI), a new general-purpose method to impute missing data. The method combines continuous normalising flows, flow-matching, and shared conditional modelling to deal with intractabilities of traditional multiple imputation. Our comparison with nine classical and state-of-the-art imputation methods on 24 small to moderate-dimensional tabular data sets shows that CFMI matches or outperforms both traditional and modern techniques across a wide range of metrics. Applying the method to zero-shot imputation of time-series data, we find that it matches the accuracy of a related diffusion-based method while outperforming it in terms of computational efficiency. Overall, CFMI performs at least as well as traditional methods on lower-dimensional data while remaining scalable to high-dimensional settings, matching or exceeding the performance of other deep learning-based approaches, making it a go-to imputation method for a wide range of data types and dimensionalities.

Via

Access Paper or Ask Questions

Simulation-based Bayesian inference under model misspecification

Mar 16, 2025

Ryan P. Kelly, David J. Warne, David T. Frazier, David J. Nott, Michael U. Gutmann, Christopher Drovandi

Figure 1 for Simulation-based Bayesian inference under model misspecification

Figure 2 for Simulation-based Bayesian inference under model misspecification

Figure 3 for Simulation-based Bayesian inference under model misspecification

Figure 4 for Simulation-based Bayesian inference under model misspecification

Abstract:Simulation-based Bayesian inference (SBI) methods are widely used for parameter estimation in complex models where evaluating the likelihood is challenging but generating simulations is relatively straightforward. However, these methods commonly assume that the simulation model accurately reflects the true data-generating process, an assumption that is frequently violated in realistic scenarios. In this paper, we focus on the challenges faced by SBI methods under model misspecification. We consolidate recent research aimed at mitigating the effects of misspecification, highlighting three key strategies: i) robust summary statistics, ii) generalised Bayesian inference, and iii) error modelling and adjustment parameters. To illustrate both the vulnerabilities of popular SBI methods and the effectiveness of misspecification-robust alternatives, we present empirical results on an illustrative example.

* 46 pages, 8 figures

Via

Access Paper or Ask Questions

Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

Mar 05, 2024

Vaidotas Simkus, Michael U. Gutmann

Figure 1 for Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

Figure 2 for Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

Figure 3 for Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

Figure 4 for Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

Abstract:We consider the task of estimating variational autoencoders (VAEs) when the training data is incomplete. We show that missing data increases the complexity of the model's posterior distribution over the latent variables compared to the fully-observed case. The increased complexity may adversely affect the fit of the model due to a mismatch between the variational and model posterior distributions. We introduce two strategies based on (i) finite variational-mixture and (ii) imputation-based variational-mixture distributions to address the increased posterior complexity. Through a comprehensive evaluation of the proposed approaches, we show that variational mixtures are effective at improving the accuracy of VAE estimation from incomplete data.

Via

Access Paper or Ask Questions

Conditional Sampling of Variational Autoencoders via Iterated Approximate Ancestral Sampling

Aug 17, 2023

Vaidotas Simkus, Michael U. Gutmann

Figure 1 for Conditional Sampling of Variational Autoencoders via Iterated Approximate Ancestral Sampling

Figure 2 for Conditional Sampling of Variational Autoencoders via Iterated Approximate Ancestral Sampling

Figure 3 for Conditional Sampling of Variational Autoencoders via Iterated Approximate Ancestral Sampling

Figure 4 for Conditional Sampling of Variational Autoencoders via Iterated Approximate Ancestral Sampling

Abstract:Conditional sampling of variational autoencoders (VAEs) is needed in various applications, such as missing data imputation, but is computationally intractable. A principled choice for asymptotically exact conditional sampling is Metropolis-within-Gibbs (MWG). However, we observe that the tendency of VAEs to learn a structured latent space, a commonly desired property, can cause the MWG sampler to get "stuck" far from the target distribution. This paper mitigates the limitations of MWG: we systematically outline the pitfalls in the context of VAEs, propose two original methods that address these pitfalls, and demonstrate an improved performance of the proposed methods on a set of sampling tasks.

Via

Access Paper or Ask Questions

Designing Optimal Behavioral Experiments Using Machine Learning

May 12, 2023

Simon Valentin, Steven Kleinegesse, Neil R. Bramley, Peggy Seriès, Michael U. Gutmann, Christopher G. Lucas

Figure 1 for Designing Optimal Behavioral Experiments Using Machine Learning

Figure 2 for Designing Optimal Behavioral Experiments Using Machine Learning

Figure 3 for Designing Optimal Behavioral Experiments Using Machine Learning

Figure 4 for Designing Optimal Behavioral Experiments Using Machine Learning

Abstract:Computational models are powerful tools for understanding human cognition and behavior. They let us express our theories clearly and precisely, and offer predictions that can be subtle and often counter-intuitive. However, this same richness and ability to surprise means our scientific intuitions and traditional tools are ill-suited to designing experiments to test and compare these models. To avoid these pitfalls and realize the full potential of computational modeling, we require tools to design experiments that provide clear answers about what models explain human behavior and the auxiliary assumptions those models must make. Bayesian optimal experimental design (BOED) formalizes the search for optimal experimental designs by identifying experiments that are expected to yield informative data. In this work, we provide a tutorial on leveraging recent advances in BOED and machine learning to find optimal experiments for any kind of model that we can simulate data from, and show how by-products of this procedure allow for quick and straightforward evaluation of models and their parameters against real experimental data. As a case study, we consider theories of how people balance exploration and exploitation in multi-armed bandit decision-making tasks. We validate the presented approach using simulations and a real-world experiment. As compared to experimental designs commonly used in the literature, we show that our optimal designs more efficiently determine which of a set of models best account for individual human behavior, and more efficiently characterize behavior given a preferred model. We provide code to replicate all analyses as well as tutorial notebooks and pointers to adapt the methodology to other experimental settings.

* Under review

Via

Access Paper or Ask Questions

Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression

May 01, 2023

Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

Figure 1 for Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression

Figure 2 for Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression

Figure 3 for Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression

Figure 4 for Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression

Abstract:Functions of the ratio of the densities $p/q$ are widely used in machine learning to quantify the discrepancy between the two distributions $p$ and $q$. For high-dimensional distributions, binary classification-based density ratio estimators have shown great promise. However, when densities are well separated, estimating the density ratio with a binary classifier is challenging. In this work, we show that the state-of-the-art density ratio estimators perform poorly on well-separated cases and demonstrate that this is due to distribution shifts between training and evaluation time. We present an alternative method that leverages multi-class classification for density ratio estimation and does not suffer from distribution shift issues. The method uses a set of auxiliary densities $\{m_k\}_{k=1}^K$ and trains a multi-class logistic regression to classify the samples from $p, q$, and $\{m_k\}_{k=1}^K$ into $K+2$ classes. We show that if these auxiliary densities are constructed such that they overlap with $p$ and $q$, then a multi-class logistic regression allows for estimating $\log p/q$ on the domain of any of the $K+2$ distributions and resolves the distribution shift problems of the current state-of-the-art methods. We compare our method to state-of-the-art density ratio estimators on both synthetic and real datasets and demonstrate its superior performance on the tasks of density ratio estimation, mutual information estimation, and representation learning. Code: https://www.blackswhan.com/mdre/

* TMLR 2023

Via

Access Paper or Ask Questions

Bayesian Optimization with Informative Covariance

Aug 04, 2022

Afonso Eduardo, Michael U. Gutmann

Figure 1 for Bayesian Optimization with Informative Covariance

Figure 2 for Bayesian Optimization with Informative Covariance

Figure 3 for Bayesian Optimization with Informative Covariance

Figure 4 for Bayesian Optimization with Informative Covariance

Abstract:Bayesian Optimization is a methodology for global optimization of unknown and expensive objectives. It combines a surrogate Bayesian regression model with an acquisition function to decide where to evaluate the objective. Typical regression models are Gaussian processes with stationary covariance functions, which, however, are unable to express prior input-dependent information, in particular information about possible locations of the optimum. The ubiquity of stationary models has led to the common practice of exploiting prior information via informative mean functions. In this paper, we highlight that these models can lead to poor performance, especially in high dimensions. We propose novel informative covariance functions that leverage nonstationarity to encode preferences for certain regions of the search space and adaptively promote local exploration during the optimization. We demonstrate that they can increase the sample efficiency of the optimization in high dimensions, even under weak prior information.

Via

Access Paper or Ask Questions

Pen and Paper Exercises in Machine Learning

Jun 27, 2022

Michael U. Gutmann

Figure 1 for Pen and Paper Exercises in Machine Learning

Figure 2 for Pen and Paper Exercises in Machine Learning

Figure 3 for Pen and Paper Exercises in Machine Learning

Figure 4 for Pen and Paper Exercises in Machine Learning

Abstract:This is a collection of (mostly) pen-and-paper exercises in machine learning. The exercises are on the following topics: linear algebra, optimisation, directed graphical models, undirected graphical models, expressive power of graphical models, factor graphs and message passing, inference for hidden Markov models, model-based learning (including ICA and unnormalised models), sampling and Monte-Carlo integration, and variational inference.

* The associated github page is https://github.com/michaelgutmann/ml-pen-and-paper-exercises

Via

Access Paper or Ask Questions