Picture for Esther Derman

Esther Derman

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis

Add code
Oct 31, 2024
Viaarxiv icon

Tree Search-Based Policy Optimization under Stochastic Execution Delay

Add code
Apr 08, 2024
Viaarxiv icon

Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization

Add code
Sep 03, 2023
Viaarxiv icon

Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization

Add code
Mar 12, 2023
Viaarxiv icon

Policy Gradient for s-Rectangular Robust Markov Decision Processes

Add code
Jan 31, 2023
Viaarxiv icon

Twice regularized MDPs and the equivalence between robustness and regularization

Add code
Oct 12, 2021
Figure 1 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 2 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 3 for Twice regularized MDPs and the equivalence between robustness and regularization
Figure 4 for Twice regularized MDPs and the equivalence between robustness and regularization
Viaarxiv icon

Acting in Delayed Environments with Non-Stationary Markov Policies

Add code
Jan 28, 2021
Figure 1 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 2 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 3 for Acting in Delayed Environments with Non-Stationary Markov Policies
Figure 4 for Acting in Delayed Environments with Non-Stationary Markov Policies
Viaarxiv icon

Distributional Robustness and Regularization in Reinforcement Learning

Add code
Mar 05, 2020
Viaarxiv icon

A Bayesian Approach to Robust Reinforcement Learning

Add code
May 20, 2019
Figure 1 for A Bayesian Approach to Robust Reinforcement Learning
Figure 2 for A Bayesian Approach to Robust Reinforcement Learning
Figure 3 for A Bayesian Approach to Robust Reinforcement Learning
Figure 4 for A Bayesian Approach to Robust Reinforcement Learning
Viaarxiv icon

Soft-Robust Actor-Critic Policy-Gradient

Add code
Oct 24, 2018
Figure 1 for Soft-Robust Actor-Critic Policy-Gradient
Figure 2 for Soft-Robust Actor-Critic Policy-Gradient
Viaarxiv icon