Picture for Csaba Szepesvari

Csaba Szepesvari

Dj

Stochastic Gradient Succeeds for Bandits

Add code
Feb 27, 2024
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning via Local Planning

Add code
Jan 29, 2023
Viaarxiv icon

The Role of Baselines in Policy Gradient Optimization

Add code
Jan 16, 2023
Viaarxiv icon

Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making

Add code
Sep 29, 2022
Figure 1 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Figure 2 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Viaarxiv icon

Towards Painless Policy Optimization for Constrained MDPs

Add code
Apr 11, 2022
Figure 1 for Towards Painless Policy Optimization for Constrained MDPs
Figure 2 for Towards Painless Policy Optimization for Constrained MDPs
Figure 3 for Towards Painless Policy Optimization for Constrained MDPs
Figure 4 for Towards Painless Policy Optimization for Constrained MDPs
Viaarxiv icon

Understanding the Effect of Stochasticity in Policy Optimization

Add code
Oct 29, 2021
Figure 1 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 2 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 3 for Understanding the Effect of Stochasticity in Policy Optimization
Viaarxiv icon

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data

Add code
Jun 18, 2021
Figure 1 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Figure 2 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Viaarxiv icon

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Add code
Jun 15, 2021
Figure 1 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 2 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 3 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 4 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Viaarxiv icon

Leveraging Non-uniformity in First-order Non-convex Optimization

Add code
May 13, 2021
Figure 1 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 2 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 3 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 4 for Leveraging Non-uniformity in First-order Non-convex Optimization
Viaarxiv icon

On the Optimality of Batch Policy Optimization Algorithms

Add code
Apr 06, 2021
Figure 1 for On the Optimality of Batch Policy Optimization Algorithms
Figure 2 for On the Optimality of Batch Policy Optimization Algorithms
Viaarxiv icon