Picture for Craig Boutilier

Craig Boutilier

University of Toronto

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

Add code
Dec 18, 2024
Viaarxiv icon

Personalized and Sequential Text-to-Image Generation

Add code
Dec 10, 2024
Viaarxiv icon

Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies

Add code
Sep 26, 2024
Figure 1 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Figure 2 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Figure 3 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Figure 4 for Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Viaarxiv icon

Embedding-Aligned Language Models

Add code
May 24, 2024
Viaarxiv icon

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

Add code
Feb 25, 2024
Viaarxiv icon

Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval

Add code
Nov 15, 2023
Viaarxiv icon

Preference Elicitation with Soft Attributes in Interactive Recommendation

Add code
Oct 22, 2023
Figure 1 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Figure 2 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Figure 3 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Figure 4 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Viaarxiv icon

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Add code
Oct 09, 2023
Figure 1 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 2 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 3 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 4 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Viaarxiv icon

Demystifying Embedding Spaces using Large Language Models

Add code
Oct 06, 2023
Viaarxiv icon

Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models

Add code
Sep 22, 2023
Viaarxiv icon