Picture for Craig Boutilier

Craig Boutilier

University of Toronto

Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies

Add code
Sep 26, 2024
Figure 1 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Figure 2 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Figure 3 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Figure 4 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Viaarxiv icon

Embedding-Aligned Language Models

Add code
May 24, 2024
Viaarxiv icon

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

Add code
Feb 25, 2024
Viaarxiv icon

Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval

Add code
Nov 15, 2023
Viaarxiv icon

Preference Elicitation with Soft Attributes in Interactive Recommendation

Add code
Oct 22, 2023
Viaarxiv icon

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Add code
Oct 09, 2023
Viaarxiv icon

Demystifying Embedding Spaces using Large Language Models

Add code
Oct 06, 2023
Viaarxiv icon

Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models

Add code
Sep 22, 2023
Viaarxiv icon

Content Prompting: Modeling Content Provider Dynamics to Improve User Welfare in Recommender Ecosystems

Add code
Sep 02, 2023
Viaarxiv icon

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Add code
May 25, 2023
Viaarxiv icon