Picture for Francesco Emanuele Stradi

Francesco Emanuele Stradi

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Add code
Oct 03, 2024
Viaarxiv icon

Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback

Add code
Oct 03, 2024
Viaarxiv icon

A Primal-Dual Online Learning Approach for Dynamic Pricing of Sequentially Displayed Complementary Items under Sale Constraints

Add code
Jul 08, 2024
Viaarxiv icon

Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints

Add code
May 23, 2024
Viaarxiv icon

Learning Adversarial MDPs with Stochastic Hard Constraints

Add code
Mar 06, 2024
Viaarxiv icon

Markov Persuasion Processes: Learning to Persuade from Scratch

Add code
Feb 05, 2024
Viaarxiv icon

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Add code
Apr 27, 2023
Viaarxiv icon