Picture for Han-Dong Lim

Han-Dong Lim

Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration

Add code
Apr 15, 2025
Viaarxiv icon

Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation

Add code
Feb 13, 2025
Viaarxiv icon

A finite time analysis of distributed Q-learning

Add code
May 23, 2024
Viaarxiv icon

Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model

Add code
Feb 19, 2024
Viaarxiv icon

A primal-dual perspective for distributed TD-learning

Add code
Oct 01, 2023
Viaarxiv icon

An O.D.E. Framework of Distributed TD-Learning for Networked Multi-Agent Markov Decision Processes

Add code
Aug 17, 2023
Viaarxiv icon

Temporal Difference Learning with Experience Replay

Add code
Jun 16, 2023
Viaarxiv icon

Backstepping Temporal Difference Learning

Add code
Feb 28, 2023
Viaarxiv icon

Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View

Add code
Jul 25, 2022
Figure 1 for Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
Figure 2 for Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
Viaarxiv icon

Regularized Q-learning

Add code
Mar 01, 2022
Figure 1 for Regularized Q-learning
Figure 2 for Regularized Q-learning
Figure 3 for Regularized Q-learning
Figure 4 for Regularized Q-learning
Viaarxiv icon