Picture for Matteo Castiglioni

Matteo Castiglioni

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Add code
Oct 03, 2024
Viaarxiv icon

Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback

Add code
Oct 03, 2024
Viaarxiv icon

Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting

Add code
Sep 09, 2024
Viaarxiv icon

Beyond Primal-Dual Methods in Bandits with Stochastic and Adversarial Constraints

Add code
May 25, 2024
Viaarxiv icon

Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints

Add code
May 23, 2024
Viaarxiv icon

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Add code
May 10, 2024
Viaarxiv icon

Learning Adversarial MDPs with Stochastic Hard Constraints

Add code
Mar 06, 2024
Viaarxiv icon

Markov Persuasion Processes: Learning to Persuade from Scratch

Add code
Feb 05, 2024
Viaarxiv icon

No-Regret Learning in Bilateral Trade via Global Budget Balance

Add code
Oct 18, 2023
Viaarxiv icon

Learning Optimal Contracts: How to Exploit Small Action Spaces

Add code
Sep 18, 2023
Viaarxiv icon