Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wouter Kouw

Expected Free Energy-based Planning as Variational Inference

Apr 21, 2025

Bert de Vries, Wouter Nuijten, Thijs van de Laar, Wouter Kouw, Sepideh Adamiat, Tim Nisslbeck, Mykola Lukashchuk, Hoang Minh Huu Nguyen, Marco Hidalgo Araya, Raphael Tresor(+6 more)

Abstract:We address the problem of planning under uncertainty, where an agent must choose actions that not only achieve desired outcomes but also reduce uncertainty. Traditional methods often treat exploration and exploitation as separate objectives, lacking a unified inferential foundation. Active inference, grounded in the Free Energy Principle, offers such a foundation by minimizing Expected Free Energy (EFE), a cost function that combines utility with epistemic drives like ambiguity resolution and novelty seeking. However, the computational burden of EFE minimization has remained a major obstacle to its scalability. In this paper, we show that EFE-based planning arises naturally from minimizing a variational free energy functional on a generative model augmented with preference and epistemic priors. This result reinforces theoretical consistency with the Free Energy Principle, by casting planning itself as variational inference. Our formulation yields optimal policies that jointly support goal achievement and information gain, while incorporating a complexity term that accounts for bounded computational resources. This unifying framework connects and extends existing methods, enabling scalable, resource-aware implementations of active inference agents.

* 16 pages

Via

Access Paper or Ask Questions

Variational message passing for online polynomial NARMAX identification

Apr 02, 2022

Wouter Kouw, Albert Podusenko, Magnus Koudahl, Maarten Schoukens

Figure 1 for Variational message passing for online polynomial NARMAX identification

Figure 2 for Variational message passing for online polynomial NARMAX identification

Figure 3 for Variational message passing for online polynomial NARMAX identification

Figure 4 for Variational message passing for online polynomial NARMAX identification

Abstract:We propose a variational Bayesian inference procedure for online nonlinear system identification. For each output observation, a set of parameter posterior distributions is updated, which is then used to form a posterior predictive distribution for future outputs. We focus on the class of polynomial NARMAX models, which we cast into probabilistic form and represent in terms of a Forney-style factor graph. Inference in this graph is efficiently performed by a variational message passing algorithm. We show empirically that our variational Bayesian estimator outperforms an online recursive least-squares estimator, most notably in small sample size settings and low noise regimes, and performs on par with an iterative least-squares estimator trained offline.

* 6 pages, 4 figures. Accepted to the American Control Conference 2022

Via

Access Paper or Ask Questions

Back to the Future -- Sequential Alignment of Text Representations

Sep 10, 2019

Johannes Bjerva, Wouter Kouw, Isabelle Augenstein

Figure 1 for Back to the Future -- Sequential Alignment of Text Representations

Figure 2 for Back to the Future -- Sequential Alignment of Text Representations

Figure 3 for Back to the Future -- Sequential Alignment of Text Representations

Figure 4 for Back to the Future -- Sequential Alignment of Text Representations

Abstract:Language evolves over time in many ways relevant to natural language processing tasks. For example, recent occurrences of tokens 'BERT' and 'ELMO' in publications refer to neural network architectures rather than persons. This type of temporal signal is typically overlooked, but is important if one aims to deploy a machine learning model over an extended period of time. In particular, language evolution causes data drift between time-steps in sequential decision-making tasks. Examples of such tasks include prediction of paper acceptance for yearly conferences (regular intervals) or author stance prediction for rumours on Twitter (irregular intervals). Inspired by successes in computer vision, we tackle data drift by sequentially aligning learned representations. We evaluate on three challenging tasks varying in terms of time-scales, linguistic units, and domains. These tasks show our method outperforming several strong baselines, including using all available data. We argue that, due to its low computational expense, sequential alignment is a practical solution to dealing with language evolution.

Via

Access Paper or Ask Questions

On reducing sampling variance in covariate shift using control variates

Oct 17, 2017

Wouter Kouw, Marco Loog

Figure 1 for On reducing sampling variance in covariate shift using control variates

Figure 2 for On reducing sampling variance in covariate shift using control variates

Figure 3 for On reducing sampling variance in covariate shift using control variates

Figure 4 for On reducing sampling variance in covariate shift using control variates

Abstract:Covariate shift classification problems can in principle be tackled by importance-weighting training samples. However, the sampling variance of the risk estimator is often scaled up dramatically by the weights. This means that during cross-validation - when the importance-weighted risk is repeatedly evaluated - suboptimal hyperparameter estimates are produced. We study the sampling variances of the importance-weighted versus the oracle estimator as a function of the relative scale of the training data. We show that introducing a control variate can reduce the variance of the importance-weighted risk estimator, which leads to superior regularization parameter estimates when the training data is much smaller in scale than the test data.

* Submitted to the journal Pattern Recognition Letters

Via

Access Paper or Ask Questions