Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michiel Straat

Control of Rayleigh-Bénard Convection: Effectiveness of Reinforcement Learning in the Turbulent Regime

Apr 16, 2025

Thorben Markmann, Michiel Straat, Sebastian Peitz, Barbara Hammer

Abstract:Data-driven flow control has significant potential for industry, energy systems, and climate science. In this work, we study the effectiveness of Reinforcement Learning (RL) for reducing convective heat transfer in the 2D Rayleigh-B\'enard Convection (RBC) system under increasing turbulence. We investigate the generalizability of control across varying initial conditions and turbulence levels and introduce a reward shaping technique to accelerate the training. RL agents trained via single-agent Proximal Policy Optimization (PPO) are compared to linear proportional derivative (PD) controllers from classical control theory. The RL agents reduced convection, measured by the Nusselt Number, by up to 33% in moderately turbulent systems and 10% in highly turbulent settings, clearly outperforming PD control in all settings. The agents showed strong generalization performance across different initial conditions and to a significant extent, generalized to higher degrees of turbulence. The reward shaping improved sample efficiency and consistently stabilized the Nusselt Number to higher turbulence levels.

Via

Access Paper or Ask Questions

Solving Turbulent Rayleigh-Bénard Convection using Fourier Neural Operators

Jan 27, 2025

Michiel Straat, Thorben Markmann, Barbara Hammer

Abstract:We train Fourier Neural Operator (FNO) surrogate models for Rayleigh-B\'enard Convection (RBC), a model for convection processes that occur in nature and industrial settings. We compare the prediction accuracy and model properties of FNO surrogates to two popular surrogates used in fluid dynamics: the Dynamic Mode Decomposition and the Linearly-Recurrent Autoencoder Network. We regard Direct Numerical Simulations (DNS) of the RBC equations as the ground truth on which the models are trained and evaluated in different settings. The FNO performs favorably when compared to the DMD and LRAN and its predictions are fast and highly accurate for this task. Additionally, we show its zero-shot super-resolution ability for the convection dynamics. The FNO model has a high potential to be used in downstream tasks such as flow control in RBC.

* Accepted at ESANN 2025, Bruges, Belgium (European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning)

Via

Access Paper or Ask Questions

Koopman-Based Surrogate Modelling of Turbulent Rayleigh-Bénard Convection

May 10, 2024

Thorben Markmann, Michiel Straat, Barbara Hammer

Figure 1 for Koopman-Based Surrogate Modelling of Turbulent Rayleigh-Bénard Convection

Figure 2 for Koopman-Based Surrogate Modelling of Turbulent Rayleigh-Bénard Convection

Figure 3 for Koopman-Based Surrogate Modelling of Turbulent Rayleigh-Bénard Convection

Figure 4 for Koopman-Based Surrogate Modelling of Turbulent Rayleigh-Bénard Convection

Abstract:Several related works have introduced Koopman-based Machine Learning architectures as a surrogate model for dynamical systems. These architectures aim to learn non-linear measurements (also known as observables) of the system's state that evolve by a linear operator and are, therefore, amenable to model-based linear control techniques. So far, mainly simple systems have been targeted, and Koopman architectures as reduced-order models for more complex dynamics have not been fully explored. Hence, we use a Koopman-inspired architecture called the Linear Recurrent Autoencoder Network (LRAN) for learning reduced-order dynamics in convection flows of a Rayleigh B\'enard Convection (RBC) system at different amounts of turbulence. The data is obtained from direct numerical simulations of the RBC system. A traditional fluid dynamics method, the Kernel Dynamic Mode Decomposition (KDMD), is used to compare the LRAN. For both methods, we performed hyperparameter sweeps to identify optimal settings. We used a Normalized Sum of Square Error measure for the quantitative evaluation of the models, and we also studied the model predictions qualitatively. We obtained more accurate predictions with the LRAN than with KDMD in the most turbulent setting. We conjecture that this is due to the LRAN's flexibility in learning complicated observables from data, thereby serving as a viable surrogate model for the main structure of fluid dynamics in turbulent convection settings. In contrast, KDMD was more effective in lower turbulence settings due to the repetitiveness of the convection flow. The feasibility of Koopman-based surrogate models for turbulent fluid flows opens possibilities for efficient model-based control techniques useful in a variety of industrial settings.

* Accepted at the International Joint Conference on Neural Networks (IJCNN) 2024

Via

Access Paper or Ask Questions

An Industry 4.0 example: real-time quality control for steel-based mass production using Machine Learning on non-invasive sensor data

Jun 12, 2022

Michiel Straat, Kevin Koster, Nick Goet, Kerstin Bunte

Figure 1 for An Industry 4.0 example: real-time quality control for steel-based mass production using Machine Learning on non-invasive sensor data

Figure 2 for An Industry 4.0 example: real-time quality control for steel-based mass production using Machine Learning on non-invasive sensor data

Figure 3 for An Industry 4.0 example: real-time quality control for steel-based mass production using Machine Learning on non-invasive sensor data

Figure 4 for An Industry 4.0 example: real-time quality control for steel-based mass production using Machine Learning on non-invasive sensor data

Abstract:Insufficient steel quality in mass production can cause extremely costly damage to tooling, production downtimes and low quality products. Automatic, fast and cheap strategies to estimate essential material properties for quality control, risk mitigation and the prediction of faults are highly desirable. In this work we analyse a high throughput production line of steel-based products. Currently, the material quality is checked using manual destructive testing, which is slow, wasteful and covers only a tiny fraction of the material. To achieve complete testing coverage our industrial collaborator developed a contactless, non-invasive, electromagnetic sensor to measure all material during production in real-time. Our contribution is three-fold: 1) We show in a controlled experiment that the sensor can distinguish steel with deliberately altered properties. 2) 48 steel coils were fully measured non-invasively and additional destructive tests were conducted on samples to serve as ground truth. A linear model is fitted to predict from the non-invasive measurements two key material properties (yield strength and tensile strength) that normally are obtained by destructive tests. The performance is evaluated in leave-one-coil-out cross-validation. 3) The resulting model is used to analyse the material properties and the relationship with logged product faults on real production data of ~108 km of processed material measured with the non-invasive sensor. The model achieves an excellent performance (F3-score of 0.95) predicting material running out of specifications for the tensile strength. The combination of model predictions and logged product faults shows that if a significant percentage of estimated yield stress values is out of specification, the risk of product faults is high. Our analysis demonstrates promising directions for real-time quality control, risk monitoring and fault detection.

* Accepted at IJCNN 2022

Via

Access Paper or Ask Questions

Complex-valued embeddings of generic proximity data

Aug 31, 2020

Maximilian Münch, Michiel Straat, Michael Biehl, Frank-Michael Schleif

Figure 1 for Complex-valued embeddings of generic proximity data

Figure 2 for Complex-valued embeddings of generic proximity data

Abstract:Proximities are at the heart of almost all machine learning methods. If the input data are given as numerical vectors of equal lengths, euclidean distance, or a Hilbertian inner product is frequently used in modeling algorithms. In a more generic view, objects are compared by a (symmetric) similarity or dissimilarity measure, which may not obey particular mathematical properties. This renders many machine learning methods invalid, leading to convergence problems and the loss of guarantees, like generalization bounds. In many cases, the preferred dissimilarity measure is not metric, like the earth mover distance, or the similarity measure may not be a simple inner product in a Hilbert space but in its generalization a Krein space. If the input data are non-vectorial, like text sequences, proximity-based learning is used or ngram embedding techniques can be applied. Standard embeddings lead to the desired fixed-length vector encoding, but are costly and have substantial limitations in preserving the original data's full information. As an information preserving alternative, we propose a complex-valued vector embedding of proximity data. This allows suitable machine learning algorithms to use these fixed-length, complex-valued vectors for further processing. The complex-valued data can serve as an input to complex-valued machine learning algorithms. In particular, we address supervised learning and use extensions of prototype-based learning. The proposed approach is evaluated on a variety of standard benchmarks and shows strong performance compared to traditional techniques in processing non-metric or non-psd proximity data.

* Proximity learning, embedding, complex values, complex-valued embedding, learning vector quantization

Via

Access Paper or Ask Questions

Supervised Learning in the Presence of Concept Drift: A modelling framework

May 21, 2020

Michiel Straat, Fthi Abadi, Zhuoyun Kan, Christina Göpfert, Barbara Hammer, Michael Biehl

Figure 1 for Supervised Learning in the Presence of Concept Drift: A modelling framework

Figure 2 for Supervised Learning in the Presence of Concept Drift: A modelling framework

Figure 3 for Supervised Learning in the Presence of Concept Drift: A modelling framework

Figure 4 for Supervised Learning in the Presence of Concept Drift: A modelling framework

Abstract:We present a modelling framework for the investigation of supervised learning in non-stationary environments. Specifically, we model two example types of learning systems: prototype-based Learning Vector Quantization (LVQ) for classification and shallow, layered neural networks for regression tasks. We investigate so-called student teacher scenarios in which the systems are trained from a stream of high-dimensional, labeled data. Properties of the target task are considered to be non-stationary due to drift processes while the training is performed. Different types of concept drift are studied, which affect the density of example inputs only, the target rule itself, or both. By applying methods from statistical physics, we develop a modelling framework for the mathematical analysis of the training dynamics in non-stationary environments. Our results show that standard LVQ algorithms are already suitable for the training in non-stationary environments to a certain extent. However, the application of weight decay as an explicit mechanism of forgetting does not improve the performance under the considered drift processes. Furthermore, we investigate gradient-based training of layered neural networks with sigmoidal activation functions and compare with the use of rectified linear units (ReLU). Our findings show that the sensitivity to concept drift and the effectiveness of weight decay differs significantly between the two types of activation function.

* 17 pages in twocolumn

Via

Access Paper or Ask Questions

Hidden Unit Specialization in Layered Neural Networks: ReLU vs. Sigmoidal Activation

Oct 16, 2019

Elisa Oostwal, Michiel Straat, Michael Biehl

Figure 1 for Hidden Unit Specialization in Layered Neural Networks: ReLU vs. Sigmoidal Activation

Figure 2 for Hidden Unit Specialization in Layered Neural Networks: ReLU vs. Sigmoidal Activation

Figure 3 for Hidden Unit Specialization in Layered Neural Networks: ReLU vs. Sigmoidal Activation

Figure 4 for Hidden Unit Specialization in Layered Neural Networks: ReLU vs. Sigmoidal Activation

Abstract:We study layered neural networks of rectified linear units (ReLU) in a modelling framework for stochastic training processes. The comparison with sigmoidal activation functions is in the center of interest. We compute typical learning curves for shallow networks with K hidden units in matching student teacher scenarios. The systems exhibit sudden changes of the generalization performance via the process of hidden unit specialization at critical sizes of the training set. Surprisingly, our results show that the training behavior of ReLU networks is qualitatively different from that of networks with sigmoidal activations. In networks with K >= 3 sigmoidal hidden units, the transition is discontinuous: Specialized network configurations co-exist and compete with states of poor performance even for very large training sets. On the contrary, the use of ReLU activations results in continuous transitions for all K: For large enough training sets, two competing, differently specialized states display similar generalization abilities, which coincide exactly for large networks in the limit K to infinity.

* 10 pages

Via

Access Paper or Ask Questions

Segmentation of blood vessels in retinal fundus images

May 29, 2019

Michiel Straat, Jorrit Oosterhof

Figure 1 for Segmentation of blood vessels in retinal fundus images

Figure 2 for Segmentation of blood vessels in retinal fundus images

Figure 3 for Segmentation of blood vessels in retinal fundus images

Figure 4 for Segmentation of blood vessels in retinal fundus images

Abstract:In recent years, several automatic segmentation methods have been proposed for blood vessels in retinal fundus images, ranging from using cheap and fast trainable filters to complicated neural networks and even deep learning. One example of a filted-based segmentation method is B-COSFIRE. In this approach the image filter is trained with example prototype patterns, to which the filter becomes selective by finding points in a Difference of Gaussian response on circles around the center with large intensity variation. In this paper we discuss and evaluate several of these vessel segmentation methods. We take a closer look at B-COSFIRE and study the performance of B-COSFIRE on the recently published IOSTAR dataset by experiments and we examine how the parameter values affect the performance. In the experiment we manage to reach a segmentation accuracy of 0.9419. Based on our findings we discuss when B-COSFIRE is the preferred method to use and in which circumstances it could be beneficial to use a more (computationally) complex segmentation method. We also shortly discuss areas beyond blood vessel segmentation where these methods can be used to segment elongated structures, such as rivers in satellite images or nerves of a leaf.

* Conference: SC@RUG 2017, 6 pages

Via

Access Paper or Ask Questions

On-line learning dynamics of ReLU neural networks using statistical physics techniques

Mar 18, 2019

Michiel Straat, Michael Biehl

Figure 1 for On-line learning dynamics of ReLU neural networks using statistical physics techniques

Figure 2 for On-line learning dynamics of ReLU neural networks using statistical physics techniques

Figure 3 for On-line learning dynamics of ReLU neural networks using statistical physics techniques

Figure 4 for On-line learning dynamics of ReLU neural networks using statistical physics techniques

Abstract:We introduce exact macroscopic on-line learning dynamics of two-layer neural networks with ReLU units in the form of a system of differential equations, using techniques borrowed from statistical physics. For the first experiments, numerical solutions reveal similar behavior compared to sigmoidal activation researched in earlier work. In these experiments the theoretical results show good correspondence with simulations. In ove-rrealizable and unrealizable learning scenarios, the learning behavior of ReLU networks shows distinctive characteristics compared to sigmoidal networks.

* Accepted contribution: ESANN 2019, 6 pages European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning 2019

Via

Access Paper or Ask Questions