Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chris Beeler

ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

May 23, 2023

Chris Beeler, Sriram Ganapathi Subramanian, Kyle Sprague, Nouha Chatti, Colin Bellinger, Mitchell Shahen, Nicholas Paquin, Mark Baula, Amanuel Dawit, Zihan Yang(+3 more)

Figure 1 for ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

Figure 2 for ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

Figure 3 for ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

Figure 4 for ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

Abstract:This paper provides a simulated laboratory for making use of Reinforcement Learning (RL) for chemical discovery. Since RL is fairly data intensive, training agents `on-the-fly' by taking actions in the real world is infeasible and possibly dangerous. Moreover, chemical processing and discovery involves challenges which are not commonly found in RL benchmarks and therefore offer a rich space to work in. We introduce a set of highly customizable and open-source RL environments, ChemGymRL, based on the standard Open AI Gym template. ChemGymRL supports a series of interconnected virtual chemical benches where RL agents can operate and train. The paper introduces and details each of these benches using well-known chemical reactions as illustrative examples, and trains a set of standard RL algorithms in each of these benches. Finally, discussion and comparison of the performances of several standard RL methods are provided in addition to a list of directions for future work as a vision for the further development and usage of ChemGymRL.

* 19 pages, 13 figures, 2 tables

Via

Access Paper or Ask Questions

Dynamic programming with partial information to overcome navigational uncertainty in a nautical environment

Dec 29, 2021

Chris Beeler, Xinkai Li, Mark Crowley, Maia Fraser, Isaac Tamblyn

Figure 1 for Dynamic programming with partial information to overcome navigational uncertainty in a nautical environment

Figure 2 for Dynamic programming with partial information to overcome navigational uncertainty in a nautical environment

Figure 3 for Dynamic programming with partial information to overcome navigational uncertainty in a nautical environment

Figure 4 for Dynamic programming with partial information to overcome navigational uncertainty in a nautical environment

Abstract:Using a toy nautical navigation environment, we show that dynamic programming can be used when only partial information about a partially observed Markov decision process (POMDP) is known. By incorporating uncertainty into our model, we show that navigation policies can be constructed that maintain safety. Adding controlled sensing methods, we show that these policies can also lower measurement costs at the same time.

* 9 pages, 4 figures

Via

Access Paper or Ask Questions

Optimizing thermodynamic trajectories using evolutionary reinforcement learning

Mar 20, 2019

Chris Beeler, Uladzimir Yahorau, Rory Coles, Kyle Mills, Stephen Whitelam, Isaac Tamblyn

Figure 1 for Optimizing thermodynamic trajectories using evolutionary reinforcement learning

Figure 2 for Optimizing thermodynamic trajectories using evolutionary reinforcement learning

Figure 3 for Optimizing thermodynamic trajectories using evolutionary reinforcement learning

Figure 4 for Optimizing thermodynamic trajectories using evolutionary reinforcement learning

Abstract:Using a model heat engine we show that neural network-based reinforcement learning can identify thermodynamic trajectories of maximal efficiency. We use an evolutionary learning algorithm to evolve a population of neural networks, subject to a directive to maximize the efficiency of a trajectory composed of a set of elementary thermodynamic processes; the resulting networks learn to carry out the maximally-efficient Carnot, Stirling, or Otto cycles. Given additional irreversible processes this evolutionary scheme learns a hitherto unknown thermodynamic cycle. Our results show how the reinforcement learning strategies developed for game playing can be applied to solve physical problems conditioned upon path-extensive order parameters.

Via

Access Paper or Ask Questions