Picture for Alberto Marchesi

Alberto Marchesi

Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback

Add code
Oct 03, 2024
Viaarxiv icon

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Add code
Oct 03, 2024
Viaarxiv icon

Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints

Add code
May 23, 2024
Viaarxiv icon

Learning Adversarial MDPs with Stochastic Hard Constraints

Add code
Mar 06, 2024
Viaarxiv icon

Markov Persuasion Processes: Learning to Persuade from Scratch

Add code
Feb 05, 2024
Viaarxiv icon

Learning Optimal Contracts: How to Exploit Small Action Spaces

Add code
Sep 18, 2023
Viaarxiv icon

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Add code
Apr 27, 2023
Viaarxiv icon

A Unifying Framework for Online Optimization with Long-Term Constraints

Add code
Sep 15, 2022
Figure 1 for A Unifying Framework for Online Optimization with Long-Term Constraints
Viaarxiv icon

Sequential Information Design: Learning to Persuade in the Dark

Add code
Sep 08, 2022
Figure 1 for Sequential Information Design: Learning to Persuade in the Dark
Figure 2 for Sequential Information Design: Learning to Persuade in the Dark
Figure 3 for Sequential Information Design: Learning to Persuade in the Dark
Figure 4 for Sequential Information Design: Learning to Persuade in the Dark
Viaarxiv icon

Multi-Receiver Online Bayesian Persuasion

Add code
Jun 11, 2021
Viaarxiv icon