Picture for Zaiwei Chen

Zaiwei Chen

A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms

Add code
Feb 20, 2025
Viaarxiv icon

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Add code
Sep 02, 2024
Viaarxiv icon

Approximate Global Convergence of Independent Learning in Multi-Agent Systems

Add code
May 30, 2024
Viaarxiv icon

Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games

Add code
Dec 08, 2023
Viaarxiv icon

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

Add code
Mar 28, 2023
Viaarxiv icon

Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games

Add code
Mar 08, 2023
Viaarxiv icon

A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Add code
Mar 03, 2023
Viaarxiv icon

Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning

Add code
Nov 30, 2022
Viaarxiv icon

Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation

Add code
Aug 05, 2022
Viaarxiv icon

Target Network and Truncation Overcome The Deadly triad in $Q$-Learning

Add code
Mar 05, 2022
Figure 1 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 2 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 3 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 4 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Viaarxiv icon