Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

INRIA Lille - Nord Europe

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis

Add code
Oct 31, 2024
Viaarxiv icon

Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

Add code
Apr 02, 2024
Viaarxiv icon

Contextual Bandits with Stage-wise Constraints

Add code
Jan 15, 2024
Viaarxiv icon

Maximum Entropy Model Correction in Reinforcement Learning

Add code
Nov 29, 2023
Viaarxiv icon

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage

Add code
Oct 27, 2023
Viaarxiv icon

Preference Elicitation with Soft Attributes in Interactive Recommendation

Add code
Oct 22, 2023
Viaarxiv icon

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Add code
Oct 09, 2023
Viaarxiv icon

A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits

Add code
Jun 02, 2023
Viaarxiv icon

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Add code
May 25, 2023
Viaarxiv icon

Private and Communication-Efficient Algorithms for Entropy Estimation

Add code
May 12, 2023
Viaarxiv icon