Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sven Mika

Distributed Reinforcement Learning is a Dataflow Problem

Dec 03, 2020

Eric Liang, Zhanghao Wu, Michael Luo, Sven Mika, Ion Stoica

Figure 1 for Distributed Reinforcement Learning is a Dataflow Problem

Figure 2 for Distributed Reinforcement Learning is a Dataflow Problem

Figure 3 for Distributed Reinforcement Learning is a Dataflow Problem

Figure 4 for Distributed Reinforcement Learning is a Dataflow Problem

Abstract:Researchers and practitioners in the field of reinforcement learning (RL) frequently leverage parallel computation, which has led to a plethora of new algorithms and systems in the last few years. In this paper, we re-examine the challenges posed by distributed RL and try to view it through the lens of an old idea: distributed dataflow. We show that viewing RL as a dataflow problem leads to highly composable and performant implementations. We propose AnonFlow, a hybrid actor-dataflow programming model for distributed RL, and validate its practicality by porting the full suite of algorithms in AnonLib, a widely-adopted distributed RL library.

* This paper has been withdrawn by the author due to the need to compare sample throughput and training times to more dataflow-based frameworks

Via

Access Paper or Ask Questions

RLgraph: Flexible Computation Graphs for Deep Reinforcement Learning

Oct 21, 2018

Michael Schaarschmidt, Sven Mika, Kai Fricke, Eiko Yoneki

Figure 1 for RLgraph: Flexible Computation Graphs for Deep Reinforcement Learning

Figure 2 for RLgraph: Flexible Computation Graphs for Deep Reinforcement Learning

Figure 3 for RLgraph: Flexible Computation Graphs for Deep Reinforcement Learning

Figure 4 for RLgraph: Flexible Computation Graphs for Deep Reinforcement Learning

Abstract:Reinforcement learning (RL) tasks are challenging to implement, execute and test due to algorithmic instability, hyper-parameter sensitivity, and heterogeneous distributed communication patterns. We argue for the separation of logical component composition, backend graph definition, and distributed execution. To this end, we introduce RLgraph, a library for designing and executing high performance RL computation graphs in both static graph and define-by-run paradigms. The resulting implementations yield high performance across different deep learning frameworks and distributed backends.

Via

Access Paper or Ask Questions