Abstract:This paper introduces the Agtrup (BlueKolding) dataset, collected from Denmark's Agtrup wastewater treatment plant, specifically designed to enhance phosphorus removal via chemical and biological methods. This rich dataset is assembled through a high-frequency Supervisory Control and Data Acquisition (SCADA) system data collection process, which captures a wide range of variables related to the operational dynamics of nutrient removal. It comprises time-series data featuring measurements sampled to a frequency of two minutes across various control, process, and environmental variables. The comprehensive dataset aims to foster significant advancements in wastewater management by supporting the development of sophisticated predictive models and optimizing operational strategies. By providing detailed insights into the interactions and efficiencies of chemical and biological phosphorus removal processes, the dataset serves as a vital resource for environmental researchers and engineers focused on improving the sustainability and effectiveness of wastewater treatment operations. The ultimate goal of this dataset is to facilitate the creation of digital twins and the application of machine learning techniques, such as deep reinforcement learning, to predict and enhance system performance under varying operational conditions.
Abstract:Even though Deep Reinforcement Learning (DRL) showed outstanding results in the fields of Robotics and Games, it is still challenging to implement it in the optimization of industrial processes like wastewater treatment. One of the challenges is the lack of a simulation environment that will represent the actual plant as accurately as possible to train DRL policies. Stochasticity and non-linearity of wastewater treatment data lead to unstable and incorrect predictions of models over long time horizons. One possible reason for the models' incorrect simulation behavior can be related to the issue of compounding error, which is the accumulation of errors throughout the simulation. The compounding error occurs because the model utilizes its predictions as inputs at each time step. The error between the actual data and the prediction accumulates as the simulation continues. We implemented two methods to improve the trained models for wastewater treatment data, which resulted in more accurate simulators: 1- Using the model's prediction data as input in the training step as a tool of correction, and 2- Change in the loss function to consider the long-term predicted shape (dynamics). The experimental results showed that implementing these methods can improve the behavior of simulators in terms of Dynamic Time Warping throughout a year up to 98% compared to the base model. These improvements demonstrate significant promise in creating simulators for biological processes that do not need pre-existing knowledge of the process but instead depend exclusively on time series data obtained from the system.
Abstract:Phosphorus removal is vital in wastewater treatment to reduce reliance on limited resources. Deep reinforcement learning (DRL) is a machine learning technique that can optimize complex and nonlinear systems, including the processes in wastewater treatment plants, by learning control policies through trial and error. However, applying DRL to chemical and biological processes is challenging due to the need for accurate simulators. This study trained six models to identify the phosphorus removal process and used them to create a simulator for the DRL environment. Although the models achieved high accuracy (>97%), uncertainty and incorrect prediction behavior limited their performance as simulators over longer horizons. Compounding errors in the models' predictions were identified as one of the causes of this problem. This approach for improving process control involves creating simulation environments for DRL algorithms, using data from supervisory control and data acquisition (SCADA) systems with a sufficient historical horizon without complex system modeling or parameter estimation.