Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Feb 17, 2022

Sammie Katt, Hai Nguyen, Frans A. Oliehoek, Christopher Amato

Figure 1 for BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Figure 2 for BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Figure 3 for BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Figure 4 for BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Share this with someone who'll enjoy it:

Abstract:While reinforcement learning (RL) has made great advances in scalability, exploration and partial observability are still active research topics. In contrast, Bayesian RL (BRL) provides a principled answer to both state estimation and the exploration-exploitation trade-off, but struggles to scale. To tackle this challenge, BRL frameworks with various prior assumptions have been proposed, with varied success. This work presents a representation-agnostic formulation of BRL under partially observability, unifying the previous models under one theoretical umbrella. To demonstrate its practical significance we also propose a novel derivation, Bayes-Adaptive Deep Dropout rl (BADDr), based on dropout networks. Under this parameterization, in contrast to previous work, the belief over the state and dynamics is a more scalable inference problem. We choose actions through Monte-Carlo tree search and empirically show that our method is competitive with state-of-the-art BRL methods on small domains while being able to solve much larger ones.

View paper on

Share this with someone who'll enjoy it:

Title:BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Paper and Code