Picture for Xiangyuan Zhang

Xiangyuan Zhang

Structure Matters: Dynamic Policy Gradient

Add code
Nov 07, 2024
Viaarxiv icon

Decision Transformer as a Foundation Model for Partially Observable Continuous Control

Add code
Apr 03, 2024
Viaarxiv icon

Policy Optimization for PDE Control with a Warm Start

Add code
Mar 01, 2024
Viaarxiv icon

Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms

Add code
Nov 30, 2023
Viaarxiv icon

Global Convergence of Receding-Horizon Policy Search in Learning Estimator Designs

Add code
Sep 09, 2023
Viaarxiv icon

Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient

Add code
Feb 25, 2023
Viaarxiv icon

Learning the Kalman Filter with Fine-Grained Sample Complexity

Add code
Jan 30, 2023
Viaarxiv icon

Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity

Add code
Jan 04, 2021
Figure 1 for Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity
Figure 2 for Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity
Figure 3 for Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity
Figure 4 for Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity
Viaarxiv icon

Non-Cooperative Inverse Reinforcement Learning

Add code
Nov 03, 2019
Figure 1 for Non-Cooperative Inverse Reinforcement Learning
Viaarxiv icon