Picture for Thinh T. Doan

Thinh T. Doan

Bayesian meta learning for trustworthy uncertainty quantification

Add code
Jul 27, 2024
Viaarxiv icon

Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning

Add code
May 15, 2024
Viaarxiv icon

Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning

Add code
May 03, 2024
Viaarxiv icon

Fast Nonlinear Two-Time-Scale Stochastic Approximation: Achieving $O(1/k)$ Finite-Sample Complexity

Add code
Jan 24, 2024
Viaarxiv icon

Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems

Add code
Mar 23, 2023
Viaarxiv icon

Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games

Add code
Jun 15, 2022
Figure 1 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Figure 2 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Figure 3 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Figure 4 for Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
Viaarxiv icon

Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games

Add code
May 27, 2022
Figure 1 for Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games
Figure 2 for Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games
Viaarxiv icon

Convergence Rates of Two-Time-Scale Gradient Descent-Ascent Dynamics for Solving Nonconvex Min-Max Problems

Add code
Dec 17, 2021
Figure 1 for Convergence Rates of Two-Time-Scale Gradient Descent-Ascent Dynamics for Solving Nonconvex Min-Max Problems
Viaarxiv icon

Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes

Add code
Oct 21, 2021
Figure 1 for Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
Figure 2 for Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
Viaarxiv icon

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

Add code
Oct 01, 2021
Figure 1 for A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
Figure 2 for A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
Viaarxiv icon