Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anna Levit

Free energy-based reinforcement learning using a quantum processor

May 29, 2017

Anna Levit, Daniel Crawford, Navid Ghadermarzy, Jaspreet S. Oberoi, Ehsan Zahedinejad, Pooya Ronagh

Figure 1 for Free energy-based reinforcement learning using a quantum processor

Figure 2 for Free energy-based reinforcement learning using a quantum processor

Figure 3 for Free energy-based reinforcement learning using a quantum processor

Figure 4 for Free energy-based reinforcement learning using a quantum processor

Abstract:Recent theoretical and experimental results suggest the possibility of using current and near-future quantum hardware in challenging sampling tasks. In this paper, we introduce free energy-based reinforcement learning (FERL) as an application of quantum hardware. We propose a method for processing a quantum annealer's measured qubit spin configurations in approximating the free energy of a quantum Boltzmann machine (QBM). We then apply this method to perform reinforcement learning on the grid-world problem using the D-Wave 2000Q quantum annealer. The experimental results show that our technique is a promising method for harnessing the power of quantum sampling in reinforcement learning tasks.

Via

Access Paper or Ask Questions

Reinforcement Learning Using Quantum Boltzmann Machines

Dec 25, 2016

Daniel Crawford, Anna Levit, Navid Ghadermarzy, Jaspreet S. Oberoi, Pooya Ronagh

Figure 1 for Reinforcement Learning Using Quantum Boltzmann Machines

Figure 2 for Reinforcement Learning Using Quantum Boltzmann Machines

Figure 3 for Reinforcement Learning Using Quantum Boltzmann Machines

Figure 4 for Reinforcement Learning Using Quantum Boltzmann Machines

Abstract:We investigate whether quantum annealers with select chip layouts can outperform classical computers in reinforcement learning tasks. We associate a transverse field Ising spin Hamiltonian with a layout of qubits similar to that of a deep Boltzmann machine (DBM) and use simulated quantum annealing (SQA) to numerically simulate quantum sampling from this system. We design a reinforcement learning algorithm in which the set of visible nodes representing the states and actions of an optimal policy are the first and last layers of the deep network. In absence of a transverse field, our simulations show that DBMs train more effectively than restricted Boltzmann machines (RBM) with the same number of weights. Since sampling from Boltzmann distributions of a DBM is not classically feasible, this is evidence of advantage of a non-Turing sampling oracle. We then develop a framework for training the network as a quantum Boltzmann machine (QBM) in the presence of a significant transverse field for reinforcement learning. This further improves the reinforcement learning method using DBMs.

Via

Access Paper or Ask Questions