Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Martin Schiegg

Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems

Feb 21, 2020

Hans Kersting, Nicholas Krämer, Martin Schiegg, Christian Daniel, Michael Tiemann, Philipp Hennig

Figure 1 for Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems

Figure 2 for Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems

Figure 3 for Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems

Figure 4 for Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems

Abstract:Likelihood-free (a.k.a. simulation-based) inference problems are inverse problems with expensive, or intractable, forward models. ODE inverse problems are commonly treated as likelihood-free, as their forward map has to be numerically approximated by an ODE solver. This, however, is not a fundamental constraint but just a lack of functionality in classic ODE solvers, which do not return a likelihood but a point estimate. To address this shortcoming, we employ Gaussian ODE filtering (a probabilistic numerical method for ODEs) to construct a local Gaussian approximation to the likelihood. This approximation yields tractable estimators for the gradient and Hessian of the (log-)likelihood. Insertion of these estimators into existing gradient-based optimization and sampling methods engenders new solvers for ODE inverse problems. We demonstrate that these methods outperform standard likelihood-free approaches on three benchmark-systems.

* 10 pages (+ 5 pages appendix), 6 figures

Via

Access Paper or Ask Questions

Relational Generalized Few-Shot Learning

Jul 22, 2019

Xiahan Shi, Leonard Salewski, Martin Schiegg, Zeynep Akata, Max Welling

Figure 1 for Relational Generalized Few-Shot Learning

Figure 2 for Relational Generalized Few-Shot Learning

Figure 3 for Relational Generalized Few-Shot Learning

Figure 4 for Relational Generalized Few-Shot Learning

Abstract:Transferring learned models to novel tasks is a challenging problem, particularly if only very few labeled examples are available. Although this few-shot learning setup has received a lot of attention recently, most proposed methods focus on discriminating novel classes only. Instead, we consider the extended setup of generalized few-shot learning (GFSL), where the model is required to perform classification on the joint label space consisting of both previously seen and novel classes. We propose a graph-based framework that explicitly models relationships between all seen and novel classes in the joint label space. Our model Graph-convolutional Global Prototypical Networks (GcGPN) incorporates these inter-class relations using graph-convolution in order to embed novel class representations into the existing space of previously seen classes in a globally consistent manner. Our approach ensures both fast adaptation and global discrimination, which is the major challenge in GFSL. We demonstrate the benefits of our model on two challenging benchmark datasets.

Via

Access Paper or Ask Questions

Probabilistic Recurrent State-Space Models

Feb 10, 2018

Andreas Doerr, Christian Daniel, Martin Schiegg, Duy Nguyen-Tuong, Stefan Schaal, Marc Toussaint, Sebastian Trimpe

Figure 1 for Probabilistic Recurrent State-Space Models

Figure 2 for Probabilistic Recurrent State-Space Models

Figure 3 for Probabilistic Recurrent State-Space Models

Figure 4 for Probabilistic Recurrent State-Space Models

Abstract:State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification. Deterministic versions of SSMs (e.g. LSTMs) proved extremely successful in modeling complex time series data. Fully probabilistic SSMs, however, are often found hard to train, even for smaller problems. To overcome this limitation, we propose a novel model formulation and a scalable training algorithm based on doubly stochastic variational inference and Gaussian processes. In contrast to existing work, the proposed variational approximation allows one to fully capture the latent state temporal correlations. These correlations are the key to robust training. The effectiveness of the proposed PR-SSM is evaluated on a set of real-world benchmark datasets in comparison to state-of-the-art probabilistic model learning methods. Scalability and robustness are demonstrated on a high dimensional problem.

Via

Access Paper or Ask Questions