Picture for Nicola Gatti

Nicola Gatti

Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback

Add code
Oct 03, 2024
Viaarxiv icon

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Add code
Oct 03, 2024
Viaarxiv icon

Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting

Add code
Sep 09, 2024
Viaarxiv icon

A Primal-Dual Online Learning Approach for Dynamic Pricing of Sequentially Displayed Complementary Items under Sale Constraints

Add code
Jul 08, 2024
Viaarxiv icon

Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints

Add code
May 23, 2024
Viaarxiv icon

Learning Adversarial MDPs with Stochastic Hard Constraints

Add code
Mar 06, 2024
Viaarxiv icon

Markov Persuasion Processes: Learning to Persuade from Scratch

Add code
Feb 05, 2024
Viaarxiv icon

Towards Fully Adaptive Regret Minimization in Heavy-Tailed Bandits

Add code
Oct 04, 2023
Viaarxiv icon

Learning Optimal Contracts: How to Exploit Small Action Spaces

Add code
Sep 18, 2023
Viaarxiv icon

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Add code
Apr 27, 2023
Viaarxiv icon