Picture for Sharan Vaswani

Sharan Vaswani

Improving OOD Generalization of Pre-trained Encoders via Aligned Embedding-Space Ensembles

Add code
Nov 20, 2024
Viaarxiv icon

Fast Convergence of Softmax Policy Mirror Ascent

Add code
Nov 18, 2024
Viaarxiv icon

Towards Principled, Practical Policy Gradient for Bandits and Tabular MDPs

Add code
May 21, 2024
Viaarxiv icon

From Inverse Optimization to Feasibility to ERM

Add code
Feb 27, 2024
Viaarxiv icon

Noise-adaptive (Accelerated) Stochastic Heavy-Ball Momentum

Add code
Jan 12, 2024
Viaarxiv icon

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees

Add code
May 24, 2023
Viaarxiv icon

Target-based Surrogates for Stochastic Optimization

Add code
Feb 06, 2023
Viaarxiv icon

Improved Policy Optimization for Online Imitation Learning

Add code
Jul 29, 2022
Figure 1 for Improved Policy Optimization for Online Imitation Learning
Figure 2 for Improved Policy Optimization for Online Imitation Learning
Figure 3 for Improved Policy Optimization for Online Imitation Learning
Figure 4 for Improved Policy Optimization for Online Imitation Learning
Viaarxiv icon

Near-Optimal Sample Complexity Bounds for Constrained MDPs

Add code
Jun 13, 2022
Figure 1 for Near-Optimal Sample Complexity Bounds for Constrained MDPs
Figure 2 for Near-Optimal Sample Complexity Bounds for Constrained MDPs
Viaarxiv icon

Towards Painless Policy Optimization for Constrained MDPs

Add code
Apr 11, 2022
Figure 1 for Towards Painless Policy Optimization for Constrained MDPs
Figure 2 for Towards Painless Policy Optimization for Constrained MDPs
Figure 3 for Towards Painless Policy Optimization for Constrained MDPs
Figure 4 for Towards Painless Policy Optimization for Constrained MDPs
Viaarxiv icon