Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nikhil Vemgal

DGFN: Double Generative Flow Networks

Nov 06, 2023

Elaine Lau, Nikhil Vemgal, Doina Precup, Emmanuel Bengio

Figure 1 for DGFN: Double Generative Flow Networks

Figure 2 for DGFN: Double Generative Flow Networks

Figure 3 for DGFN: Double Generative Flow Networks

Figure 4 for DGFN: Double Generative Flow Networks

Abstract:Deep learning is emerging as an effective tool in drug discovery, with potential applications in both predictive and generative models. Generative Flow Networks (GFlowNets/GFNs) are a recently introduced method recognized for the ability to generate diverse candidates, in particular in small molecule generation tasks. In this work, we introduce double GFlowNets (DGFNs). Drawing inspiration from reinforcement learning and Double Deep Q-Learning, we introduce a target network used to sample trajectories, while updating the main network with these sampled trajectories. Empirical results confirm that DGFNs effectively enhance exploration in sparse reward domains and high-dimensional state spaces, both challenging aspects of de-novo design in drug discovery.

* Accepted to NeurIPS 2023 Workshop

Via

Access Paper or Ask Questions

An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Jul 18, 2023

Nikhil Vemgal, Elaine Lau, Doina Precup

Figure 1 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Figure 2 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Figure 3 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Figure 4 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Abstract:Reinforcement Learning (RL) algorithms aim to learn an optimal policy by iteratively sampling actions to learn how to maximize the total expected return, $R(x)$. GFlowNets are a special class of algorithms designed to generate diverse candidates, $x$, from a discrete set, by learning a policy that approximates the proportional sampling of $R(x)$. GFlowNets exhibit improved mode discovery compared to conventional RL algorithms, which is very useful for applications such as drug discovery and combinatorial search. However, since GFlowNets are a relatively recent class of algorithms, many techniques which are useful in RL have not yet been associated with them. In this paper, we study the utilization of a replay buffer for GFlowNets. We explore empirically various replay buffer sampling techniques and assess the impact on the speed of mode discovery and the quality of the modes discovered. Our experimental results in the Hypergrid toy domain and a molecule synthesis environment demonstrate significant improvements in mode discovery when training with a replay buffer, compared to training only with trajectories generated on-policy.

* Accepted to ICML 2023 workshop on Structured Probabilistic Inference & Generative Modeling

Via

Access Paper or Ask Questions