Picture for Patrick Rebeschini

Patrick Rebeschini

University of Oxford

Robust Gradient Descent for Phase Retrieval

Add code
Oct 14, 2024
Viaarxiv icon

Differentiable Cost-Parameterized Monge Map Estimators

Add code
Jun 12, 2024
Viaarxiv icon

Meta-learning the mirror map in policy mirror descent

Add code
Feb 07, 2024
Figure 1 for Meta-learning the mirror map in policy mirror descent
Figure 2 for Meta-learning the mirror map in policy mirror descent
Figure 3 for Meta-learning the mirror map in policy mirror descent
Figure 4 for Meta-learning the mirror map in policy mirror descent
Viaarxiv icon

Generalization Bounds for Label Noise Stochastic Gradient Descent

Add code
Nov 01, 2023
Viaarxiv icon

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity

Add code
Oct 02, 2023
Viaarxiv icon

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Add code
Feb 22, 2023
Figure 1 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Figure 2 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Viaarxiv icon

A Novel Framework for Policy Mirror Descent with General Parametrization and Linear Convergence

Add code
Jan 30, 2023
Viaarxiv icon

Linear Convergence for Natural Policy Gradient with Log-linear Policy Parametrization

Add code
Sep 30, 2022
Viaarxiv icon

Exponential Tail Local Rademacher Complexity Risk Bounds Without the Bernstein Condition

Add code
Feb 23, 2022
Viaarxiv icon

Time-independent Generalization Bounds for SGLD in Non-convex Settings

Add code
Nov 25, 2021
Figure 1 for Time-independent Generalization Bounds for SGLD in Non-convex Settings
Viaarxiv icon