Picture for Scott M. Jordan

Scott M. Jordan

Position: Benchmarking is Limited in Reinforcement Learning Research

Add code
Jun 23, 2024
Viaarxiv icon

A New View on Planning in Online Reinforcement Learning

Add code
Jun 03, 2024
Viaarxiv icon

From Past to Future: Rethinking Eligibility Traces

Add code
Dec 20, 2023
Viaarxiv icon

Behavior Alignment via Reward Function Optimization

Add code
Oct 31, 2023
Viaarxiv icon

Coagent Networks: Generalized and Scaled

Add code
May 16, 2023
Figure 1 for Coagent Networks: Generalized and Scaled
Figure 2 for Coagent Networks: Generalized and Scaled
Figure 3 for Coagent Networks: Generalized and Scaled
Figure 4 for Coagent Networks: Generalized and Scaled
Viaarxiv icon

Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model

Add code
Feb 02, 2023
Viaarxiv icon

Towards Safe Policy Improvement for Non-Stationary MDPs

Add code
Oct 23, 2020
Figure 1 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 2 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 3 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 4 for Towards Safe Policy Improvement for Non-Stationary MDPs
Viaarxiv icon

Evaluating the Performance of Reinforcement Learning Algorithms

Add code
Jun 30, 2020
Figure 1 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 2 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 3 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 4 for Evaluating the Performance of Reinforcement Learning Algorithms
Viaarxiv icon

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Add code
Jun 06, 2019
Viaarxiv icon