Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:RLFlow: Optimising Neural Network Subgraph Transformation with World Models

May 09, 2022

Sean Parker, Sami Alabed, Eiko Yoneki

Figure 1 for RLFlow: Optimising Neural Network Subgraph Transformation with World Models

Figure 2 for RLFlow: Optimising Neural Network Subgraph Transformation with World Models

Figure 3 for RLFlow: Optimising Neural Network Subgraph Transformation with World Models

Figure 4 for RLFlow: Optimising Neural Network Subgraph Transformation with World Models

Share this with someone who'll enjoy it:

Abstract:Training deep learning models takes an extremely long execution time and consumes large amounts of computing resources. At the same time, recent research proposed systems and compilers that are expected to decrease deep learning models runtime. An effective optimisation methodology in data processing is desirable, and the reduction of compute requirements of deep learning models is the focus of extensive research. In this paper, we address the neural network sub-graph transformation by exploring reinforcement learning (RL) agents to achieve performance improvement. Our proposed approach RLFlow can learn to perform neural network subgraph transformations, without the need for expertly designed heuristics to achieve a high level of performance. Recent work has aimed at applying RL to computer systems with some success, especially using model-free RL techniques. Model-based reinforcement learning methods have seen an increased focus in research as they can be used to learn the transition dynamics of the environment; this can be leveraged to train an agent using a hallucinogenic environment such as World Model (WM), thereby increasing sample efficiency compared to model-free approaches. WM uses variational auto-encoders and it builds a model of the system and allows exploring the model in an inexpensive way. In RLFlow, we propose a design for a model-based agent with WM which learns to optimise the architecture of neural networks by performing a sequence of sub-graph transformations to reduce model runtime. We show that our approach can match the state-of-the-art performance on common convolutional networks and outperforms by up to 5% those based on transformer-style architectures

* 14 pages, 11 figures

View paper on

Share this with someone who'll enjoy it:

Title:RLFlow: Optimising Neural Network Subgraph Transformation with World Models

Paper and Code