Picture for Josiah P. Hanna

Josiah P. Hanna

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer

Add code
Dec 12, 2024
Figure 1 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 2 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 3 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 4 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Viaarxiv icon

Stable Offline Value Function Learning with Bisimulation-based Representations

Add code
Oct 02, 2024
Viaarxiv icon

Reinforcement Learning via Auxiliary Task Distillation

Add code
Jun 24, 2024
Viaarxiv icon

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Add code
Jun 07, 2024
Figure 1 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 2 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 3 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 4 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Viaarxiv icon

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

Add code
Jun 04, 2024
Figure 1 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Figure 2 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Figure 3 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Figure 4 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Viaarxiv icon

Adaptive Exploration for Data-Efficient General Value Function Evaluations

Add code
May 13, 2024
Viaarxiv icon

On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling

Add code
Nov 14, 2023
Viaarxiv icon

Multi-task Representation Learning for Pure Exploration in Bilinear Bandits

Add code
Nov 01, 2023
Viaarxiv icon

State-Action Similarity-Based Representations for Off-Policy Evaluation

Add code
Oct 27, 2023
Viaarxiv icon

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

Add code
Oct 27, 2023
Viaarxiv icon