Picture for Runzhe Wu

Runzhe Wu

Diffusing States and Matching Scores: A New Framework for Imitation Learning

Add code
Oct 17, 2024
Viaarxiv icon

Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

Add code
Jun 17, 2024
Figure 1 for Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics
Viaarxiv icon

Making RL with Preference-based Feedback Efficient via Randomization

Add code
Oct 23, 2023
Viaarxiv icon

Contextual Bandits and Imitation Learning via Preference-Based Active Queries

Add code
Jul 24, 2023
Viaarxiv icon

Selective Sampling and Imitation Learning via Online Regression

Add code
Jul 11, 2023
Viaarxiv icon

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

Add code
May 25, 2023
Figure 1 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Figure 2 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Viaarxiv icon

Distributional Offline Policy Evaluation with Predictive Error Guarantees

Add code
Feb 19, 2023
Figure 1 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 2 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 3 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Figure 4 for Distributional Offline Policy Evaluation with Predictive Error Guarantees
Viaarxiv icon

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

Add code
Jun 05, 2021
Figure 1 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 2 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 3 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 4 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Viaarxiv icon