Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Niklas Wahlström

Repulsive Ensembles for Bayesian Inference in Physics-informed Neural Networks

May 22, 2025

Philipp Pilar, Markus Heinonen, Niklas Wahlström

Abstract:Physics-informed neural networks (PINNs) have proven an effective tool for solving differential equations, in particular when considering non-standard or ill-posed settings. When inferring solutions and parameters of the differential equation from data, uncertainty estimates are preferable to point estimates, as they give an idea about the accuracy of the solution. In this work, we consider the inverse problem and employ repulsive ensembles of PINNs (RE-PINN) for obtaining such estimates. The repulsion is implemented by adding a particular repulsive term to the loss function, which has the property that the ensemble predictions correspond to the true Bayesian posterior in the limit of infinite ensemble members. Where possible, we compare the ensemble predictions to Monte Carlo baselines. Whereas the standard ensemble tends to collapse to maximum-a-posteriori solutions, the repulsive ensemble produces significantly more accurate uncertainty estimates and exhibits higher sample diversity.

Via

Access Paper or Ask Questions

Probabilistic matching of real and generated data statistics in generative adversarial networks

Jun 19, 2023

Philipp Pilar, Niklas Wahlström

Abstract:Generative adversarial networks constitute a powerful approach to generative modeling. While generated samples often are indistinguishable from real data, there is no guarantee that they will follow the true data distribution. In this work, we propose a method to ensure that the distributions of certain generated data statistics coincide with the respective distributions of the real data. In order to achieve this, we add a Kullback-Leibler term to the generator loss function: the KL divergence is taken between the true distributions as represented by a conditional energy-based model, and the corresponding generated distributions obtained from minibatch values at each iteration. We evaluate the method on a synthetic dataset and two real-world datasets and demonstrate improved performance of our method.

Via

Access Paper or Ask Questions

Invertible Kernel PCA with Random Fourier Features

Mar 09, 2023

Daniel Gedon, Antôni H. Ribeiro, Niklas Wahlström, Thomas B. Schön

Abstract:Kernel principal component analysis (kPCA) is a widely studied method to construct a low-dimensional data representation after a nonlinear transformation. The prevailing method to reconstruct the original input signal from kPCA -- an important task for denoising -- requires us to solve a supervised learning problem. In this paper, we present an alternative method where the reconstruction follows naturally from the compression step. We first approximate the kernel with random Fourier features. Then, we exploit the fact that the nonlinear transformation is invertible in a certain subdomain. Hence, the name \emph{invertible kernel PCA (ikPCA)}. We experiment with different data modalities and show that ikPCA performs similarly to kPCA with supervised reconstruction on denoising tasks, making it a strong alternative.

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions

Physics-informed neural networks with unknown measurement noise

Nov 28, 2022

Philipp Pilar, Niklas Wahlström

Abstract:Physics-informed neural networks (PINNs) constitute a flexible approach to both finding solutions and identifying parameters of partial differential equations. Most works on the topic assume noiseless data, or data contaminated by weak Gaussian noise. We show that the standard PINN framework breaks down in case of non-Gaussian noise. We give a way of resolving this fundamental issue and we propose to jointly train an energy-based model (EBM) to learn the correct noise distribution. We illustrate the improved performance of our approach using multiple examples.

Via

Access Paper or Ask Questions

Incorporating Sum Constraints into Multitask Gaussian Processes

Feb 03, 2022

Philipp Pilar, Carl Jidling, Thomas B. Schön, Niklas Wahlström

Figure 1 for Incorporating Sum Constraints into Multitask Gaussian Processes

Figure 2 for Incorporating Sum Constraints into Multitask Gaussian Processes

Figure 3 for Incorporating Sum Constraints into Multitask Gaussian Processes

Figure 4 for Incorporating Sum Constraints into Multitask Gaussian Processes

Abstract:Machine learning models can be improved by adapting them to respect existing background knowledge. In this paper we consider multitask Gaussian processes, with background knowledge in the form of constraints that require a specific sum of the outputs to be constant. This is achieved by conditioning the prior distribution on the constraint fulfillment. The approach allows for both linear and nonlinear constraints. We demonstrate that the constraints are fulfilled with high precision and that the construction can improve the overall prediction accuracy as compared to the standard Gaussian process.

Via

Access Paper or Ask Questions

Learning deep autoregressive models for hierarchical data

May 06, 2021

Carl R. Andersson, Niklas Wahlström, Thomas B. Schön

Figure 1 for Learning deep autoregressive models for hierarchical data

Figure 2 for Learning deep autoregressive models for hierarchical data

Figure 3 for Learning deep autoregressive models for hierarchical data

Figure 4 for Learning deep autoregressive models for hierarchical data

Abstract:We propose a model for hierarchical structured data as an extension to the stochastic temporal convolutional network. The proposed model combines an autoregressive model with a hierarchical variational autoencoder and downsampling to achieve superior computational complexity. We evaluate the proposed model on two different types of sequential data: speech and handwritten text. The results are promising with the proposed model achieving state-of-the-art performance.

Via

Access Paper or Ask Questions

Deep State Space Models for Nonlinear System Identification

Mar 31, 2020

Daniel Gedon, Niklas Wahlström, Thomas B. Schön, Lennart Ljung

Figure 1 for Deep State Space Models for Nonlinear System Identification

Figure 2 for Deep State Space Models for Nonlinear System Identification

Figure 3 for Deep State Space Models for Nonlinear System Identification

Figure 4 for Deep State Space Models for Nonlinear System Identification

Abstract:An actively evolving model class for generative temporal models developed in the deep learning community are deep state space models (SSMs) which have a close connection to classic SSMs. In this work six new deep SSMs are implemented and evaluated for the identification of established nonlinear dynamic system benchmarks. The models and their parameter learning algorithms are elaborated rigorously. The usage of deep SSMs as a black-box identification model can describe a wide range of dynamics due to the flexibility of deep neural networks. Additionally, the uncertainty of the system is modelled and therefore one obtains a much richer representation and a whole class of systems to describe the underlying dynamics.

Via

Access Paper or Ask Questions

Deep Convolutional Networks in System Identification

Sep 04, 2019

Carl Andersson, Antônio H. Ribeiro, Koen Tiels, Niklas Wahlström, Thomas B. Schön

Figure 1 for Deep Convolutional Networks in System Identification

Figure 2 for Deep Convolutional Networks in System Identification

Figure 3 for Deep Convolutional Networks in System Identification

Figure 4 for Deep Convolutional Networks in System Identification

Abstract:Recent developments within deep learning are relevant for nonlinear system identification problems. In this paper, we establish connections between the deep learning and the system identification communities. It has recently been shown that convolutional architectures are at least as capable as recurrent architectures when it comes to sequence modeling tasks. Inspired by these results we explore the explicit relationships between the recently proposed temporal convolutional network (TCN) and two classic system identification model structures; Volterra series and block-oriented models. We end the paper with an experimental study where we provide results on two real-world problems, the well-known Silverbox dataset and a newer dataset originating from ground vibration experiments on an F-16 fighter aircraft.

* Accepted to Conference on Decision and Control, The first two authors contributed equally

Via

Access Paper or Ask Questions

Data-Driven Impulse Response Regularization via Deep Learning

Oct 11, 2018

Carl Andersson, Niklas Wahlström, Thomas B. Schön

Figure 1 for Data-Driven Impulse Response Regularization via Deep Learning

Figure 2 for Data-Driven Impulse Response Regularization via Deep Learning

Figure 3 for Data-Driven Impulse Response Regularization via Deep Learning

Figure 4 for Data-Driven Impulse Response Regularization via Deep Learning

Abstract:We consider the problem of impulse response estimation of stable linear single-input single-output systems. It is a well-studied problem where flexible non-parametric models recently offered a leap in performance compared to the classical finite-dimensional model structures. Inspired by this development and the success of deep learning we propose a new flexible data-driven model. Our experiments indicate that the new model is capable of exploiting even more of the hidden patterns that are present in the input-output data as compared to the non-parametric models.

Via

Access Paper or Ask Questions

Probabilistic approach to limited-data computed tomography reconstruction

Sep 11, 2018

Zenith Purisha, Carl Jidling, Niklas Wahlström, Simo Särkkä, Thomas B. Schön

Figure 1 for Probabilistic approach to limited-data computed tomography reconstruction

Figure 2 for Probabilistic approach to limited-data computed tomography reconstruction

Figure 3 for Probabilistic approach to limited-data computed tomography reconstruction

Figure 4 for Probabilistic approach to limited-data computed tomography reconstruction

Abstract:We consider the problem of reconstructing the internal structure of an object from limited x-ray projections. In this work, we use a Gaussian process to model the target function. In contrast to other established methods, this comes with the advantage of not requiring any manual parameter tuning, which usually arises in classical regularization strategies. The Gaussian process is well-known in a heavy computation for the inversion of a covariance matrix, and in this work, by employing an approximative spectral-based technique, we reduce the computational complexity and avoid the need of numerical integration. Results from simulated and real data indicate that this approach is less sensitive to streak artifacts as compared to the commonly used method of filteredback projection, an analytic reconstruction algorithm using Radon inversion formula.

Via

Access Paper or Ask Questions