Picture for Shangtong Zhang

Shangtong Zhang

CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening

Add code
Nov 26, 2024
Viaarxiv icon

Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise

Add code
Nov 20, 2024
Viaarxiv icon

Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning

Add code
Oct 08, 2024
Figure 1 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Figure 2 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Figure 3 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Figure 4 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Viaarxiv icon

Doubly Optimal Policy Evaluation for Reinforcement Learning

Add code
Oct 03, 2024
Viaarxiv icon

Almost Sure Convergence of Average Reward Temporal Difference Learning

Add code
Sep 29, 2024
Viaarxiv icon

Almost Sure Convergence of Linear Temporal Difference Learning with Arbitrary Features

Add code
Sep 18, 2024
Viaarxiv icon

Efficient Multi-Policy Evaluation for Reinforcement Learning

Add code
Aug 16, 2024
Viaarxiv icon

Transformers Learn Temporal Difference Methods for In-Context Reinforcement Learning

Add code
May 22, 2024
Viaarxiv icon

The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise

Add code
Feb 06, 2024
Viaarxiv icon

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Add code
Aug 07, 2023
Viaarxiv icon