Picture for Patrick Rebeschini

Patrick Rebeschini

University of Oxford

Black-Box Uniform Stability for Non-Euclidean Empirical Risk Minimization

Add code
Dec 20, 2024
Viaarxiv icon

Learning Loss Landscapes in Preference Optimization

Add code
Nov 10, 2024
Viaarxiv icon

Robust Gradient Descent for Phase Retrieval

Add code
Oct 14, 2024
Viaarxiv icon

Differentiable Cost-Parameterized Monge Map Estimators

Add code
Jun 12, 2024
Viaarxiv icon

Meta-learning the mirror map in policy mirror descent

Add code
Feb 07, 2024
Figure 1 for Meta-learning the mirror map in policy mirror descent
Figure 2 for Meta-learning the mirror map in policy mirror descent
Figure 3 for Meta-learning the mirror map in policy mirror descent
Figure 4 for Meta-learning the mirror map in policy mirror descent
Viaarxiv icon

Generalization Bounds for Label Noise Stochastic Gradient Descent

Add code
Nov 01, 2023
Viaarxiv icon

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity

Add code
Oct 02, 2023
Viaarxiv icon

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Add code
Feb 22, 2023
Figure 1 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Figure 2 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Viaarxiv icon

A Novel Framework for Policy Mirror Descent with General Parametrization and Linear Convergence

Add code
Jan 30, 2023
Viaarxiv icon

Linear Convergence for Natural Policy Gradient with Log-linear Policy Parametrization

Add code
Sep 30, 2022
Viaarxiv icon