Picture for Yinlam Chow

Yinlam Chow

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

Add code
Dec 18, 2024
Viaarxiv icon

Personalized and Sequential Text-to-Image Generation

Add code
Dec 10, 2024
Viaarxiv icon

Embedding-Aligned Language Models

Add code
May 24, 2024
Viaarxiv icon

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

Add code
Feb 25, 2024
Viaarxiv icon

Preference Elicitation with Soft Attributes in Interactive Recommendation

Add code
Oct 22, 2023
Figure 1 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Figure 2 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Figure 3 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Figure 4 for Preference Elicitation with Soft Attributes in Interactive Recommendation
Viaarxiv icon

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Add code
Oct 09, 2023
Figure 1 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 2 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 3 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 4 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Viaarxiv icon

Demystifying Embedding Spaces using Large Language Models

Add code
Oct 06, 2023
Viaarxiv icon

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

Add code
Feb 21, 2023
Viaarxiv icon

Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

Add code
Jul 25, 2022
Figure 1 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 2 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 3 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 4 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Viaarxiv icon

A Mixture-of-Expert Approach to RL-based Dialogue Management

Add code
May 31, 2022
Figure 1 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 2 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 3 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 4 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Viaarxiv icon