Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

May 10, 2020

Hirohisa Watanabe, Mineto Tsukada, Hiroki Matsutani

Figure 1 for An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

Figure 2 for An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

Figure 3 for An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

Figure 4 for An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

Share this with someone who'll enjoy it:

Abstract:DQN (Deep Q-Network) is a method to perform Q-learning for reinforcement learning using deep neural networks. DQNs require large buffers for experience reply and rely on backpropagation based iterative optimization, making them difficult to be implemented on resource-limited edge devices. In this paper, we propose a lightweight on-device reinforcement learning approach for low-cost FPGA devices. It exploits a recently proposed neural-network based on-device learning approach that does not rely on the backpropagation method but uses ELM (Extreme Learning Machine) and OS-ELM (Online Sequential ELM) based training algorithms. In addition, we propose a combination of L2 regularization and spectral normalization for the on-device reinforcement learning, so that output values of the neural networks can be fit into a certain range and the reinforcement learning becomes stable. The proposed reinforcement learning approach is designed for Xilinx PYNQ-Z1 board as a low-cost FPGA platform. The experiment results using OpenAI Gym demonstrate that the proposed algorithm and its FPGA implementation complete a CartPole-v0 task 29.76x and 126.06x faster than a conventional DQN-based approach when the number of hidden-layer nodes is 64.

* 14 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

Paper and Code