Picture for Sheryl Hsu

Sheryl Hsu

FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users

Add code
Feb 26, 2025
Viaarxiv icon

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

Add code
Oct 31, 2024
Viaarxiv icon

RLVF: Learning from Verbal Feedback without Overgeneralization

Add code
Feb 16, 2024
Figure 1 for RLVF: Learning from Verbal Feedback without Overgeneralization
Figure 2 for RLVF: Learning from Verbal Feedback without Overgeneralization
Figure 3 for RLVF: Learning from Verbal Feedback without Overgeneralization
Figure 4 for RLVF: Learning from Verbal Feedback without Overgeneralization
Viaarxiv icon

The Power of Many: A Physarum Swarm Steiner Tree Algorithm

Add code
Oct 15, 2021
Figure 1 for The Power of Many: A Physarum Swarm Steiner Tree Algorithm
Figure 2 for The Power of Many: A Physarum Swarm Steiner Tree Algorithm
Figure 3 for The Power of Many: A Physarum Swarm Steiner Tree Algorithm
Figure 4 for The Power of Many: A Physarum Swarm Steiner Tree Algorithm
Viaarxiv icon