Picture for Luca Viano

Luca Viano

Multi-agent imitation learning with function approximation: Linear Markov games and beyond

Add code
Feb 26, 2026
Viaarxiv icon

Provably avoiding over-optimization in Direct Preference Optimization without knowing the data distribution

Add code
Feb 05, 2026
Viaarxiv icon

Direct Preference Optimization with Rating Information: Practical Algorithms and Provable Gains

Add code
Jan 31, 2026
Viaarxiv icon

Inverse Q-Learning Done Right: Offline Imitation Learning in $Q^π$-Realizable MDPs

Add code
May 26, 2025
Viaarxiv icon

Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning

Add code
May 23, 2025
Viaarxiv icon

IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic

Add code
Feb 27, 2025
Viaarxiv icon

Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning

Add code
Feb 19, 2025
Viaarxiv icon

Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees

Add code
Feb 18, 2025
Viaarxiv icon

Best of Both Worlds: Regret Minimization versus Minimax Play

Add code
Feb 17, 2025
Figure 1 for Best of Both Worlds: Regret Minimization versus Minimax Play
Figure 2 for Best of Both Worlds: Regret Minimization versus Minimax Play
Figure 3 for Best of Both Worlds: Regret Minimization versus Minimax Play
Viaarxiv icon

Imitation Learning in Discounted Linear MDPs without exploration assumptions

Add code
May 03, 2024
Viaarxiv icon