Picture for Huizhen Yu

Huizhen Yu

Asynchronous Stochastic Approximation and Average-Reward Reinforcement Learning

Add code
Sep 05, 2024
Viaarxiv icon

On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes

Add code
Aug 29, 2024
Viaarxiv icon

A Note on Stability in Asynchronous Stochastic Approximation without Communication Delays

Add code
Dec 22, 2023
Viaarxiv icon

Two geometric input transformation methods for fast online reinforcement learning with neural nets

Add code
Sep 06, 2018
Figure 1 for Two geometric input transformation methods for fast online reinforcement learning with neural nets
Figure 2 for Two geometric input transformation methods for fast online reinforcement learning with neural nets
Figure 3 for Two geometric input transformation methods for fast online reinforcement learning with neural nets
Figure 4 for Two geometric input transformation methods for fast online reinforcement learning with neural nets
Viaarxiv icon

On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning

Add code
Mar 28, 2018
Viaarxiv icon

On Convergence of Emphatic Temporal-Difference Learning

Add code
Dec 28, 2017
Figure 1 for On Convergence of Emphatic Temporal-Difference Learning
Viaarxiv icon

Multi-step Off-policy Learning Without Importance Sampling Ratios

Add code
Feb 09, 2017
Figure 1 for Multi-step Off-policy Learning Without Importance Sampling Ratios
Figure 2 for Multi-step Off-policy Learning Without Importance Sampling Ratios
Figure 3 for Multi-step Off-policy Learning Without Importance Sampling Ratios
Figure 4 for Multi-step Off-policy Learning Without Importance Sampling Ratios
Viaarxiv icon

Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize

Add code
Jan 20, 2017
Viaarxiv icon

Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms

Add code
May 06, 2016
Figure 1 for Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms
Figure 2 for Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms
Figure 3 for Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms
Figure 4 for Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms
Viaarxiv icon

Emphatic Temporal-Difference Learning

Add code
Jul 06, 2015
Figure 1 for Emphatic Temporal-Difference Learning
Figure 2 for Emphatic Temporal-Difference Learning
Viaarxiv icon