Picture for Kristopher De Asis

Kristopher De Asis

Value-aware Importance Weighting for Off-policy Reinforcement Learning

Add code
Jun 27, 2023
Viaarxiv icon

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning

Add code
Sep 09, 2019
Figure 1 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Figure 2 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Figure 3 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Figure 4 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Viaarxiv icon

Predicting Periodicity with Temporal Difference Learning

Add code
Sep 20, 2018
Figure 1 for Predicting Periodicity with Temporal Difference Learning
Figure 2 for Predicting Periodicity with Temporal Difference Learning
Figure 3 for Predicting Periodicity with Temporal Difference Learning
Figure 4 for Predicting Periodicity with Temporal Difference Learning
Viaarxiv icon

Per-decision Multi-step Temporal Difference Learning with Control Variates

Add code
Jul 05, 2018
Figure 1 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Figure 2 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Figure 3 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Figure 4 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Viaarxiv icon

Multi-step Reinforcement Learning: A Unifying Algorithm

Add code
Jun 11, 2018
Figure 1 for Multi-step Reinforcement Learning: A Unifying Algorithm
Figure 2 for Multi-step Reinforcement Learning: A Unifying Algorithm
Figure 3 for Multi-step Reinforcement Learning: A Unifying Algorithm
Figure 4 for Multi-step Reinforcement Learning: A Unifying Algorithm
Viaarxiv icon