Picture for Carlo Alfano

Carlo Alfano

Learning Loss Landscapes in Preference Optimization

Add code
Nov 10, 2024
Viaarxiv icon

Meta-learning the mirror map in policy mirror descent

Add code
Feb 07, 2024
Figure 1 for Meta-learning the mirror map in policy mirror descent
Figure 2 for Meta-learning the mirror map in policy mirror descent
Figure 3 for Meta-learning the mirror map in policy mirror descent
Figure 4 for Meta-learning the mirror map in policy mirror descent
Viaarxiv icon

A Novel Framework for Policy Mirror Descent with General Parametrization and Linear Convergence

Add code
Jan 30, 2023
Viaarxiv icon

Linear Convergence for Natural Policy Gradient with Log-linear Policy Parametrization

Add code
Sep 30, 2022
Viaarxiv icon

Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning

Add code
Sep 23, 2021
Viaarxiv icon