Picture for Han-Dong Lim

Han-Dong Lim

A finite time analysis of distributed Q-learning

Add code
May 23, 2024
Viaarxiv icon

Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model

Add code
Feb 19, 2024
Viaarxiv icon

A primal-dual perspective for distributed TD-learning

Add code
Oct 01, 2023
Viaarxiv icon

An O.D.E. Framework of Distributed TD-Learning for Networked Multi-Agent Markov Decision Processes

Add code
Aug 17, 2023
Viaarxiv icon

Temporal Difference Learning with Experience Replay

Add code
Jun 16, 2023
Viaarxiv icon

Backstepping Temporal Difference Learning

Add code
Feb 28, 2023
Viaarxiv icon

Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View

Add code
Jul 25, 2022
Figure 1 for Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
Figure 2 for Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
Viaarxiv icon

Regularized Q-learning

Add code
Mar 01, 2022
Figure 1 for Regularized Q-learning
Figure 2 for Regularized Q-learning
Figure 3 for Regularized Q-learning
Figure 4 for Regularized Q-learning
Viaarxiv icon

Versions of Gradient Temporal Difference Learning

Add code
Sep 09, 2021
Figure 1 for Versions of Gradient Temporal Difference Learning
Figure 2 for Versions of Gradient Temporal Difference Learning
Figure 3 for Versions of Gradient Temporal Difference Learning
Viaarxiv icon