Picture for Vektor Dewanto

Vektor Dewanto

Approximate discounting-free policy evaluation from transient and recurrent states

Add code
Apr 08, 2022
Figure 1 for Approximate discounting-free policy evaluation from transient and recurrent states
Figure 2 for Approximate discounting-free policy evaluation from transient and recurrent states
Figure 3 for Approximate discounting-free policy evaluation from transient and recurrent states
Figure 4 for Approximate discounting-free policy evaluation from transient and recurrent states
Viaarxiv icon

Examining average and discounted reward optimality criteria in reinforcement learning

Add code
Jul 03, 2021
Figure 1 for Examining average and discounted reward optimality criteria in reinforcement learning
Figure 2 for Examining average and discounted reward optimality criteria in reinforcement learning
Viaarxiv icon

A nearly Blackwell-optimal policy gradient method

Add code
Jun 04, 2021
Figure 1 for A nearly Blackwell-optimal policy gradient method
Figure 2 for A nearly Blackwell-optimal policy gradient method
Figure 3 for A nearly Blackwell-optimal policy gradient method
Figure 4 for A nearly Blackwell-optimal policy gradient method
Viaarxiv icon

Average-reward model-free reinforcement learning: a systematic review and literature mapping

Add code
Oct 18, 2020
Figure 1 for Average-reward model-free reinforcement learning: a systematic review and literature mapping
Figure 2 for Average-reward model-free reinforcement learning: a systematic review and literature mapping
Figure 3 for Average-reward model-free reinforcement learning: a systematic review and literature mapping
Figure 4 for Average-reward model-free reinforcement learning: a systematic review and literature mapping
Viaarxiv icon