Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ingmar Schuster

Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting

Feb 02, 2021

Kashif Rasul, Calvin Seward, Ingmar Schuster, Roland Vollgraf

Figure 1 for Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting

Figure 2 for Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting

Figure 3 for Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting

Figure 4 for Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting

Abstract:In this work, we propose \texttt{TimeGrad}, an autoregressive model for multivariate probabilistic time series forecasting which samples from the data distribution at each time step by estimating its gradient. To this end, we use diffusion probabilistic models, a class of latent variable models closely connected to score matching and energy-based methods. Our model learns gradients by optimizing a variational bound on the data likelihood and at inference time converts white noise into a sample of the distribution of interest through a Markov chain using Langevin sampling. We demonstrate experimentally that the proposed autoregressive denoising diffusion model is the new state-of-the-art multivariate probabilistic forecasting method on real-world data sets with thousands of correlated dimensions. We hope that this method is a useful tool for practitioners and lays the foundation for future research in this area.

Via

Access Paper or Ask Questions

Feature space approximation for kernel-based supervised learning

Nov 25, 2020

Patrick Gelß, Stefan Klus, Ingmar Schuster, Christof Schütte

Figure 1 for Feature space approximation for kernel-based supervised learning

Figure 2 for Feature space approximation for kernel-based supervised learning

Figure 3 for Feature space approximation for kernel-based supervised learning

Figure 4 for Feature space approximation for kernel-based supervised learning

Abstract:We propose a method for the approximation of high- or even infinite-dimensional feature vectors, which play an important role in supervised learning. The goal is to reduce the size of the training data, resulting in lower storage consumption and computational complexity. Furthermore, the method can be regarded as a regularization technique, which improves the generalizability of learned target functions. We demonstrate significant improvements in comparison to the computation of data-driven predictions involving the full training data set. The method is applied to classification and regression problems from different application areas such as image recognition, system identification, and oceanographic time series analysis.

Via

Access Paper or Ask Questions

Multi-variate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

Feb 14, 2020

Kashif Rasul, Abdul-Saboor Sheikh, Ingmar Schuster, Urs Bergmann, Roland Vollgraf

Figure 1 for Multi-variate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

Figure 2 for Multi-variate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

Figure 3 for Multi-variate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

Figure 4 for Multi-variate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

Abstract:Time series forecasting is often fundamental to scientific and engineering problems and enables decision making. With ever increasing data set sizes, a trivial solution to scale up predictions is to assume independence between interacting time series. However, modeling statistical dependencies can improve accuracy and enable analysis of interaction effects. Deep learning methods are well suited for this problem, but multi-variate models often assume a simple parametric distribution and do not scale to high dimensions. In this work we model the multi-variate temporal dynamics of time series via an autoregressive deep learning model, where the data distribution is represented by a conditioned normalizing flow. This combination retains the power of autoregressive models, such as good performance in extrapolation into the future, with the flexibility of flows as a general purpose high-dimensional distribution model, while remaining computationally tractable. We show that it improves over the state-of-the-art for standard metrics on many real-world data sets with several thousand interacting time-series.

Via

Access Paper or Ask Questions

A Rigorous Theory of Conditional Mean Embeddings

Dec 16, 2019

Ilja Klebanov, Ingmar Schuster, T. J. Sullivan

Figure 1 for A Rigorous Theory of Conditional Mean Embeddings

Figure 2 for A Rigorous Theory of Conditional Mean Embeddings

Abstract:Conditional mean embeddings (CME) have proven themselves to be a powerful tool in many machine learning applications. They allow the efficient conditioning of probability distributions within the corresponding reproducing kernel Hilbert spaces (RKHSs) by providing a linear-algebraic relation for the kernel mean embeddings of the respective probability distributions. Both centered and uncentered covariance operators have been used to define CMEs in the existing literature. In this paper, we develop a mathematically rigorous theory for both variants, discuss the merits and problems of either, and significantly weaken the conditions for applicability of CMEs. In the course of this, we demonstrate a beautiful connection to Gaussian conditioning in Hilbert spaces.

Via

Access Paper or Ask Questions

Set Flow: A Permutation Invariant Normalizing Flow

Sep 06, 2019

Kashif Rasul, Ingmar Schuster, Roland Vollgraf, Urs Bergmann

Figure 1 for Set Flow: A Permutation Invariant Normalizing Flow

Figure 2 for Set Flow: A Permutation Invariant Normalizing Flow

Figure 3 for Set Flow: A Permutation Invariant Normalizing Flow

Figure 4 for Set Flow: A Permutation Invariant Normalizing Flow

Abstract:We present a generative model that is defined on finite sets of exchangeable, potentially high dimensional, data. As the architecture is an extension of RealNVPs, it inherits all its favorable properties, such as being invertible and allowing for exact log-likelihood evaluation. We show that this architecture is able to learn finite non-i.i.d. set data distributions, learn statistical dependencies between entities of the set and is able to train and sample with variable set sizes in a computationally efficient manner. Experiments on 3D point clouds show state-of-the art likelihoods.

Via

Access Paper or Ask Questions

Kernel Conditional Density Operators

May 27, 2019

Ingmar Schuster, Mattes Mollenhauer, Stefan Klus, Krikamol Muandet

Figure 1 for Kernel Conditional Density Operators

Figure 2 for Kernel Conditional Density Operators

Figure 3 for Kernel Conditional Density Operators

Figure 4 for Kernel Conditional Density Operators

Abstract:We introduce a conditional density estimation model termed the conditional density operator. It naturally captures multivariate, multimodal output densities and is competitive with recent neural conditional density models and Gaussian processes. To derive the model, we propose a novel approach to the reconstruction of probability densities from their kernel mean embeddings by drawing connections to estimation of Radon-Nikodym derivatives in the reproducing kernel Hilbert space (RKHS). We prove finite sample error bounds which are independent of problem dimensionality. Furthermore, the resulting conditional density model is applied to real-world data and we demonstrate its versatility and competitive performance.

Via

Access Paper or Ask Questions

A kernel-based approach to molecular conformation analysis

Sep 28, 2018

Stefan Klus, Andreas Bittracher, Ingmar Schuster, Christof Schütte

Figure 1 for A kernel-based approach to molecular conformation analysis

Figure 2 for A kernel-based approach to molecular conformation analysis

Figure 3 for A kernel-based approach to molecular conformation analysis

Figure 4 for A kernel-based approach to molecular conformation analysis

Abstract:We present a novel machine learning approach to understanding conformation dynamics of biomolecules. The approach combines kernel-based techniques that are popular in the machine learning community with transfer operator theory for analyzing dynamical systems in order to identify conformation dynamics based on molecular dynamics simulation data. We show that many of the prominent methods like Markov State Models, EDMD, and TICA can be regarded as special cases of this approach and that new efficient algorithms can be constructed based on this derivation. The results of these new powerful methods will be illustrated with several examples, in particular the alanine dipeptide and the protein NTL9.

Via

Access Paper or Ask Questions

Singular Value Decomposition of Operators on Reproducing Kernel Hilbert Spaces

Jul 24, 2018

Mattes Mollenhauer, Ingmar Schuster, Stefan Klus, Christof Schütte

Figure 1 for Singular Value Decomposition of Operators on Reproducing Kernel Hilbert Spaces

Figure 2 for Singular Value Decomposition of Operators on Reproducing Kernel Hilbert Spaces

Figure 3 for Singular Value Decomposition of Operators on Reproducing Kernel Hilbert Spaces

Abstract:Reproducing kernel Hilbert spaces (RKHSs) play an important role in many statistics and machine learning applications ranging from support vector machines to Gaussian processes and kernel embeddings of distributions. Operators acting on such spaces are, for instance, required to embed conditional probability distributions in order to implement the kernel Bayes rule and build sequential data models. It was recently shown that transfer operators such as the Perron-Frobenius or Koopman operator can also be approximated in a similar fashion using covariance and cross-covariance operators and that eigenfunctions of these operators can be obtained by solving associated matrix eigenvalue problems. The goal of this paper is to provide a solid functional analytic foundation for the eigenvalue decomposition of RKHS operators and to extend the approach to the singular value decomposition. The results are illustrated with simple guiding examples.

Via

Access Paper or Ask Questions

Markov Chain Importance Sampling - a highly efficient estimator for MCMC

May 23, 2018

Ingmar Schuster, Ilja Klebanov

Figure 1 for Markov Chain Importance Sampling - a highly efficient estimator for MCMC

Figure 2 for Markov Chain Importance Sampling - a highly efficient estimator for MCMC

Figure 3 for Markov Chain Importance Sampling - a highly efficient estimator for MCMC

Figure 4 for Markov Chain Importance Sampling - a highly efficient estimator for MCMC

Abstract:Markov chain algorithms are ubiquitous in machine learning and statistics and many other disciplines. In this work we present a novel estimator applicable to several classes of Markov chains, dubbed Markov chain importance sampling (MCIS). For a broad class of Metropolis-Hastings algorithms, MCIS efficiently makes use of rejected proposals. For discretized Langevin diffusions, it provides a novel way of correcting the discretization error. Our estimator satisfies a central limit theorem and improves on error per CPU cycle, often to a large extent. As a by-product it enables estimating the normalizing constant, an important quantity in Bayesian machine learning and statistics.

Via

Access Paper or Ask Questions

Eigendecompositions of Transfer Operators in Reproducing Kernel Hilbert Spaces

May 16, 2018

Stefan Klus, Ingmar Schuster, Krikamol Muandet

Figure 1 for Eigendecompositions of Transfer Operators in Reproducing Kernel Hilbert Spaces

Figure 2 for Eigendecompositions of Transfer Operators in Reproducing Kernel Hilbert Spaces

Figure 3 for Eigendecompositions of Transfer Operators in Reproducing Kernel Hilbert Spaces

Figure 4 for Eigendecompositions of Transfer Operators in Reproducing Kernel Hilbert Spaces

Abstract:Transfer operators such as the Perron--Frobenius or Koopman operator play an important role in the global analysis of complex dynamical systems. The eigenfunctions of these operators can be used to detect metastable sets, to project the dynamics onto the dominant slow processes, or to separate superimposed signals. We extend transfer operator theory to reproducing kernel Hilbert spaces and show that these operators are related to Hilbert space representations of conditional distributions, known as conditional mean embeddings in the machine learning community. Moreover, numerical methods to compute empirical estimates of these embeddings are akin to data-driven methods for the approximation of transfer operators such as extended dynamic mode decomposition and its variants. One main benefit of the presented kernel-based approaches is that these methods can be applied to any domain where a similarity measure given by a kernel is available. We illustrate the results with the aid of guiding examples and highlight potential applications in molecular dynamics as well as video and text data analysis.

Via

Access Paper or Ask Questions