A maximum-entropy approach to off-policy evaluation in average-reward MDPs

Add code
Jun 17, 2020
Figure 1 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Figure 2 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: