Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Green Simulation Assisted Policy Gradient to Accelerate Stochastic Process Control

Oct 17, 2021

Hua Zheng, Wei Xie, M. Ben Feng

Figure 1 for Green Simulation Assisted Policy Gradient to Accelerate Stochastic Process Control

Figure 2 for Green Simulation Assisted Policy Gradient to Accelerate Stochastic Process Control

Figure 3 for Green Simulation Assisted Policy Gradient to Accelerate Stochastic Process Control

Figure 4 for Green Simulation Assisted Policy Gradient to Accelerate Stochastic Process Control

Share this with someone who'll enjoy it:

Abstract:This study is motivated by the critical challenges in the biopharmaceutical manufacturing, including high complexity, high uncertainty, and very limited process data. Each experiment run is often very expensive. To support the optimal and robust process control, we propose a general green simulation assisted policy gradient (GS-PG) framework for both online and offline learning settings. Basically, to address the key limitations of state-of-art reinforcement learning (RL), such as sample inefficiency and low reliability, we create a mixture likelihood ratio based policy gradient estimation that can leverage on the information from historical experiments conducted under different inputs, including process model coefficients and decision policy parameters. Then, to accelerate the learning of optimal and robust policy, we further propose a variance reduction based sample selection method that allows GS-PG to intelligently select and reuse most relevant historical trajectories. The selection rule automatically updates the samples to be reused during the learning of process mechanisms and the search for optimal policy. Our theoretical and empirical studies demonstrate that the proposed framework can perform better than the state-of-art policy gradient approach and accelerate the optimal robust process control for complex stochastic systems under high uncertainty.

* 36 pages, 7 figures

View paper on

Share this with someone who'll enjoy it:

Title:Green Simulation Assisted Policy Gradient to Accelerate Stochastic Process Control

Paper and Code