Picture for Paulo Rauber

Paulo Rauber

Posterior Sampling for Deep Reinforcement Learning

Add code
Apr 30, 2023
Viaarxiv icon

Hardness in Markov Decision Processes: Theory and Practice

Add code
Oct 24, 2022
Viaarxiv icon

Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits

Add code
Jul 09, 2020
Figure 1 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Figure 2 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Figure 3 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Figure 4 for Recurrent Neural-Linear Posterior Sampling for Non-Stationary Contextual Bandits
Viaarxiv icon

Hindsight policy gradients

Add code
Feb 20, 2019
Figure 1 for Hindsight policy gradients
Figure 2 for Hindsight policy gradients
Figure 3 for Hindsight policy gradients
Figure 4 for Hindsight policy gradients
Viaarxiv icon