Picture for Josiah Hanna

Josiah Hanna

Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments

Add code
Feb 11, 2024
Viaarxiv icon

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits

Add code
Jan 29, 2023
Viaarxiv icon

Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning

Add code
Jul 12, 2022
Figure 1 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 2 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 3 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 4 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Viaarxiv icon

Multi-agent Databases via Independent Learning

Add code
May 28, 2022
Figure 1 for Multi-agent Databases via Independent Learning
Figure 2 for Multi-agent Databases via Independent Learning
Figure 3 for Multi-agent Databases via Independent Learning
Viaarxiv icon

Decoupling Exploration and Exploitation in Reinforcement Learning

Add code
Jul 22, 2021
Figure 1 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 2 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 3 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 4 for Decoupling Exploration and Exploitation in Reinforcement Learning
Viaarxiv icon

Reducing Sampling Error in Batch Temporal Difference Learning

Add code
Aug 15, 2020
Viaarxiv icon

An Imitation from Observation Approach to Sim-to-Real Transfer

Add code
Aug 04, 2020
Figure 1 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 2 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 3 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 4 for An Imitation from Observation Approach to Sim-to-Real Transfer
Viaarxiv icon

Learning an Interpretable Traffic Signal Control Policy

Add code
Dec 23, 2019
Figure 1 for Learning an Interpretable Traffic Signal Control Policy
Figure 2 for Learning an Interpretable Traffic Signal Control Policy
Figure 3 for Learning an Interpretable Traffic Signal Control Policy
Figure 4 for Learning an Interpretable Traffic Signal Control Policy
Viaarxiv icon

Importance Sampling Policy Evaluation with an Estimated Behavior Policy

Add code
Sep 24, 2018
Figure 1 for Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Figure 2 for Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Figure 3 for Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Viaarxiv icon

Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes

Add code
Sep 26, 2013
Figure 1 for Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes
Figure 2 for Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes
Figure 3 for Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes
Figure 4 for Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes
Viaarxiv icon