Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sølve Eidnes

Machine learning in wastewater treatment: insights from modelling a pilot denitrification reactor

Dec 18, 2024

Eivind Bøhn, Sølve Eidnes, Kjell Rune Jonassen

Figure 1 for Machine learning in wastewater treatment: insights from modelling a pilot denitrification reactor

Figure 2 for Machine learning in wastewater treatment: insights from modelling a pilot denitrification reactor

Figure 3 for Machine learning in wastewater treatment: insights from modelling a pilot denitrification reactor

Figure 4 for Machine learning in wastewater treatment: insights from modelling a pilot denitrification reactor

Abstract:Wastewater treatment plants are increasingly recognized as promising candidates for machine learning applications, due to their societal importance and high availability of data. However, their varied designs, operational conditions, and influent characteristics hinder straightforward automation. In this study, we use data from a pilot reactor at the Veas treatment facility in Norway to explore how machine learning can be used to optimize biological nitrate ($\mathrm{NO_3^-}$) reduction to molecular nitrogen ($\mathrm{N_2}$) in the biogeochemical process known as \textit{denitrification}. Rather than focusing solely on predictive accuracy, our approach prioritizes understanding the foundational requirements for effective data-driven modelling of wastewater treatment. Specifically, we aim to identify which process parameters are most critical, the necessary data quantity and quality, how to structure data effectively, and what properties are required by the models. We find that nonlinear models perform best on the training and validation data sets, indicating nonlinear relationships to be learned, but linear models transfer better to the unseen test data, which comes later in time. The variable measuring the water temperature has a particularly detrimental effect on the models, owing to a significant change in distributions between training and test data. We therefore conclude that multiple years of data is necessary to learn robust machine learning models. By addressing foundational elements, particularly in the context of the climatic variability faced by northern regions, this work lays the groundwork for a more structured and tailored approach to machine learning for wastewater treatment. We share publicly both the data and code used to produce the results in the paper.

Via

Access Paper or Ask Questions

Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling

Mar 04, 2024

Pål V. Johnsen, Eivind Bøhn, Sølve Eidnes, Filippo Remonato, Signe Riemer-Sørensen

Figure 1 for Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling

Figure 2 for Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling

Figure 3 for Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling

Figure 4 for Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling

Abstract:Time-series modeling in process industries faces the challenge of dealing with complex, multi-faceted, and evolving data characteristics. Conventional single model approaches often struggle to capture the interplay of diverse dynamics, resulting in suboptimal forecasts. Addressing this, we introduce the Recency-Weighted Temporally-Segmented (ReWTS, pronounced `roots') ensemble model, a novel chunk-based approach for multi-step forecasting. The key characteristics of the ReWTS model are twofold: 1) It facilitates specialization of models into different dynamics by segmenting the training data into `chunks' of data and training one model per chunk. 2) During inference, an optimization procedure assesses each model on the recent past and selects the active models, such that the appropriate mixture of previously learned dynamics can be recalled to forecast the future. This method not only captures the nuances of each period, but also adapts more effectively to changes over time compared to conventional `global' models trained on all data in one go. We present a comparative analysis, utilizing two years of data from a wastewater treatment plant and a drinking water treatment plant in Norway, demonstrating the ReWTS ensemble's superiority. It consistently outperforms the global model in terms of mean squared forecasting error across various model architectures by 10-70\% on both datasets, notably exhibiting greater resilience to outliers. This approach shows promise in developing automatic, adaptable forecasting models for decision-making and control systems in process industries and other complex systems.

* Main article with 23 pages including 12 figures and 4 tables. Supplementary File with 11 pages including 9 figures

Via

Access Paper or Ask Questions

Learning Dynamical Systems from Noisy Data with Inverse-Explicit Integrators

Jun 06, 2023

Håkon Noren, Sølve Eidnes, Elena Celledoni

Figure 1 for Learning Dynamical Systems from Noisy Data with Inverse-Explicit Integrators

Figure 2 for Learning Dynamical Systems from Noisy Data with Inverse-Explicit Integrators

Figure 3 for Learning Dynamical Systems from Noisy Data with Inverse-Explicit Integrators

Figure 4 for Learning Dynamical Systems from Noisy Data with Inverse-Explicit Integrators

Abstract:We introduce the mean inverse integrator (MII), a novel approach to increase the accuracy when training neural networks to approximate vector fields of dynamical systems from noisy data. This method can be used to average multiple trajectories obtained by numerical integrators such as Runge-Kutta methods. We show that the class of mono-implicit Runge-Kutta methods (MIRK) has particular advantages when used in connection with MII. When training vector field approximations, explicit expressions for the loss functions are obtained when inserting the training data in the MIRK formulae, unlocking symmetric and high-order integrators that would otherwise be implicit for initial value problems. The combined approach of applying MIRK within MII yields a significantly lower error compared to the plain use of the numerical integrator without averaging the trajectories. This is demonstrated with experiments using data from several (chaotic) Hamiltonian systems. Additionally, we perform a sensitivity analysis of the loss functions under normally distributed perturbations, supporting the favorable performance of MII.

* 23 pages, 10 figures

Via

Access Paper or Ask Questions

Pseudo-Hamiltonian system identification

May 09, 2023

Sigurd Holmsen, Sølve Eidnes, Signe Riemer-Sørensen

Figure 1 for Pseudo-Hamiltonian system identification

Figure 2 for Pseudo-Hamiltonian system identification

Figure 3 for Pseudo-Hamiltonian system identification

Figure 4 for Pseudo-Hamiltonian system identification

Abstract:Identifying the underlying dynamics of physical systems can be challenging when only provided with observational data. In this work, we consider systems that can be modelled as first-order ordinary differential equations. By assuming a certain pseudo-Hamiltonian formulation, we are able to learn the analytic terms of internal dynamics even if the model is trained on data where the system is affected by unknown damping and external disturbances. In cases where it is difficult to find analytic terms for the disturbances, a hybrid model that uses a neural network to learn these can still accurately identify the dynamics of the system as if under ideal conditions. This makes the models applicable in situations where other system identification models fail. Furthermore, we propose to use a fourth-order symmetric integration scheme in the loss function and avoid actual integration in the training, and demonstrate on varied examples how this leads to increased performance on noisy data.

* 32 pages, 21 figures, including the appendix

Via

Access Paper or Ask Questions

Pseudo-Hamiltonian neural networks for learning partial differential equations

Apr 27, 2023

Sølve Eidnes, Kjetil Olsen Lye

Abstract:Pseudo-Hamiltonian neural networks (PHNN) were recently introduced for learning dynamical systems that can be modelled by ordinary differential equations. In this paper, we extend the method to partial differential equations. The resulting model is comprised of up to three neural networks, modelling terms representing conservation, dissipation and external forces, and discrete convolution operators that can either be learned or be prior knowledge. We demonstrate numerically the superior performance of PHNN compared to a baseline model that models the full dynamics by a single neural network. Moreover, since the PHNN model consists of three parts with different physical interpretations, these can be studied separately to gain insight into the system, and the learned model is applicable also if external forces are removed or changed.

* 32 pages, 14 figures

Via

Access Paper or Ask Questions

Port-Hamiltonian Neural Networks with State Dependent Ports

Jun 06, 2022

Sølve Eidnes, Alexander J. Stasik, Camilla Sterud, Eivind Bøhn, Signe Riemer-Sørensen

Figure 1 for Port-Hamiltonian Neural Networks with State Dependent Ports

Figure 2 for Port-Hamiltonian Neural Networks with State Dependent Ports

Figure 3 for Port-Hamiltonian Neural Networks with State Dependent Ports

Figure 4 for Port-Hamiltonian Neural Networks with State Dependent Ports

Abstract:Hybrid machine learning based on Hamiltonian formulations has recently been successfully demonstrated for simple mechanical systems. In this work, we stress-test the method on both simple mass-spring systems and more complex and realistic systems with several internal and external forces, including a system with multiple connected tanks. We quantify performance under various conditions and show that imposing different assumptions greatly affect the performance during training presenting advantages and limitations of the method. We demonstrate that port-Hamiltonian neural networks can be extended to larger dimensions with state-dependent ports. We consider learning on systems with known and unknown external forces and show how it can be used to detect deviations in a system and still provide a valid model when the deviations are removed. Finally, we propose a symmetric high-order integrator for improved training on sparse and noisy data.

* 23 pages, 12 figures

Via

Access Paper or Ask Questions