Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul Zimmerman

Adaptive Learning for Discovery

Jun 03, 2022

Ziping Xu, Eunjae Shim, Ambuj Tewari, Paul Zimmerman

Figure 1 for Adaptive Learning for Discovery

Figure 2 for Adaptive Learning for Discovery

Figure 3 for Adaptive Learning for Discovery

Figure 4 for Adaptive Learning for Discovery

Abstract:In this paper, we study a sequential decision-making problem, called Adaptive Sampling for Discovery (ASD). Starting with a large unlabeled dataset, algorithms for ASD adaptively label the points with the goal to maximize the sum of responses. This problem has wide applications to real-world discovery problems, for example drug discovery with the help of machine learning models. ASD algorithms face the well-known exploration-exploitation dilemma. The algorithm needs to choose points that yield information to improve model estimates but it also needs to exploit the model. We rigorously formulate the problem and propose a general information-directed sampling (IDS) algorithm. We provide theoretical guarantees for the performance of IDS in linear, graph and low-rank models. The benefits of IDS are shown in both simulation experiments and real-data experiments for discovering chemical reaction conditions.

Via

Access Paper or Ask Questions

TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

Jun 12, 2020

Tarun Gogineni, Ziping Xu, Exequiel Punzalan, Runxuan Jiang, Joshua Kammeraad, Ambuj Tewari, Paul Zimmerman

Figure 1 for TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

Figure 2 for TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

Figure 3 for TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

Figure 4 for TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

Abstract:Molecular geometry prediction of flexible molecules, or conformer search, is a long-standing challenge in computational chemistry. This task is of great importance for predicting structure-activity relationships for a wide variety of substances ranging from biomolecules to ubiquitous materials. Substantial computational resources are invested in Monte Carlo and Molecular Dynamics methods to generate diverse and representative conformer sets for medium to large molecules, which are yet intractable to chemoinformatic conformer search methods. We present TorsionNet, an efficient sequential conformer search technique based on reinforcement learning under the rigid rotor approximation. The model is trained via curriculum learning, whose theoretical benefit is explored in detail, to maximize a novel metric grounded in thermodynamics called the Gibbs Score. Our experimental results show that TorsionNet outperforms the highest scoring chemoinformatics method by 4x on large branched alkanes, and by several orders of magnitude on the previously unexplored biopolymer lignin, with applications in renewable energy.

Via

Access Paper or Ask Questions