Picture for Francesco Emanuele Stradi

Francesco Emanuele Stradi

Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback

Add code
Oct 03, 2024
Viaarxiv icon

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Add code
Oct 03, 2024
Viaarxiv icon

A Primal-Dual Online Learning Approach for Dynamic Pricing of Sequentially Displayed Complementary Items under Sale Constraints

Add code
Jul 08, 2024
Viaarxiv icon

Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints

Add code
May 23, 2024
Viaarxiv icon

Learning Adversarial MDPs with Stochastic Hard Constraints

Add code
Mar 06, 2024
Viaarxiv icon

Markov Persuasion Processes: Learning to Persuade from Scratch

Add code
Feb 05, 2024
Viaarxiv icon

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Add code
Apr 27, 2023
Viaarxiv icon