Picture for João Carvalho

João Carvalho

An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients

Add code
Jul 20, 2021
Figure 1 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Figure 2 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Figure 3 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Figure 4 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Viaarxiv icon

Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient

Add code
Oct 29, 2020
Figure 1 for Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient
Figure 2 for Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient
Figure 3 for Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient
Figure 4 for Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient
Viaarxiv icon

Subspace Segmentation by Successive Approximations: A Method for Low-Rank and High-Rank Data with Missing Entries

Add code
Sep 05, 2017
Figure 1 for Subspace Segmentation by Successive Approximations: A Method for Low-Rank and High-Rank Data with Missing Entries
Figure 2 for Subspace Segmentation by Successive Approximations: A Method for Low-Rank and High-Rank Data with Missing Entries
Figure 3 for Subspace Segmentation by Successive Approximations: A Method for Low-Rank and High-Rank Data with Missing Entries
Figure 4 for Subspace Segmentation by Successive Approximations: A Method for Low-Rank and High-Rank Data with Missing Entries
Viaarxiv icon

Understanding People Flow in Transportation Hubs

Add code
Apr 28, 2017
Figure 1 for Understanding People Flow in Transportation Hubs
Figure 2 for Understanding People Flow in Transportation Hubs
Figure 3 for Understanding People Flow in Transportation Hubs
Figure 4 for Understanding People Flow in Transportation Hubs
Viaarxiv icon