Picture for Xuezhou Zhang

Xuezhou Zhang

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

Add code
May 27, 2025
Viaarxiv icon

State-free Reinforcement Learning

Add code
Sep 27, 2024
Figure 1 for State-free Reinforcement Learning
Viaarxiv icon

Scale-free Adversarial Reinforcement Learning

Add code
Mar 01, 2024
Viaarxiv icon

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback

Add code
Nov 14, 2023
Figure 1 for Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
Figure 2 for Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
Viaarxiv icon

Federated Multi-Level Optimization over Decentralized Networks

Add code
Oct 10, 2023
Figure 1 for Federated Multi-Level Optimization over Decentralized Networks
Figure 2 for Federated Multi-Level Optimization over Decentralized Networks
Figure 3 for Federated Multi-Level Optimization over Decentralized Networks
Figure 4 for Federated Multi-Level Optimization over Decentralized Networks
Viaarxiv icon

Improved Algorithms for Adversarial Bandits with Unbounded Losses

Add code
Oct 03, 2023
Figure 1 for Improved Algorithms for Adversarial Bandits with Unbounded Losses
Figure 2 for Improved Algorithms for Adversarial Bandits with Unbounded Losses
Figure 3 for Improved Algorithms for Adversarial Bandits with Unbounded Losses
Viaarxiv icon

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Add code
Jun 21, 2023
Viaarxiv icon

Provable Defense against Backdoor Policies in Reinforcement Learning

Add code
Nov 18, 2022
Figure 1 for Provable Defense against Backdoor Policies in Reinforcement Learning
Figure 2 for Provable Defense against Backdoor Policies in Reinforcement Learning
Figure 3 for Provable Defense against Backdoor Policies in Reinforcement Learning
Figure 4 for Provable Defense against Backdoor Policies in Reinforcement Learning
Viaarxiv icon

Representation Learning for General-sum Low-rank Markov Games

Add code
Oct 30, 2022
Figure 1 for Representation Learning for General-sum Low-rank Markov Games
Figure 2 for Representation Learning for General-sum Low-rank Markov Games
Figure 3 for Representation Learning for General-sum Low-rank Markov Games
Figure 4 for Representation Learning for General-sum Low-rank Markov Games
Viaarxiv icon

Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization

Add code
Jun 29, 2022
Figure 1 for Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
Figure 2 for Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
Figure 3 for Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
Viaarxiv icon