Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PersonalLLM: Tailoring LLMs to Individual Preferences

Sep 30, 2024

Thomas P. Zollo, Andrew Wei Tung Siah, Naimeng Ye, Ang Li, Hongseok Namkoong

Figure 1 for PersonalLLM: Tailoring LLMs to Individual Preferences

Figure 2 for PersonalLLM: Tailoring LLMs to Individual Preferences

Figure 3 for PersonalLLM: Tailoring LLMs to Individual Preferences

Figure 4 for PersonalLLM: Tailoring LLMs to Individual Preferences

Share this with someone who'll enjoy it:

Abstract:As LLMs become capable of complex tasks, there is growing potential for personalized interactions tailored to the subtle and idiosyncratic preferences of the user. We present a public benchmark, PersonalLLM, focusing on adapting LLMs to provide maximal benefits for a particular user. Departing from existing alignment benchmarks that implicitly assume uniform preferences, we curate open-ended prompts paired with many high-quality answers over which users would be expected to display heterogeneous latent preferences. Instead of persona-prompting LLMs based on high-level attributes (e.g., user's race or response length), which yields homogeneous preferences relative to humans, we develop a method that can simulate a large user base with diverse preferences from a set of pre-trained reward models. Our dataset and generated personalities offer an innovative testbed for developing personalization algorithms that grapple with continual data sparsity--few relevant feedback from the particular user--by leveraging historical data from other (similar) users. We explore basic in-context learning and meta-learning baselines to illustrate the utility of PersonalLLM and highlight the need for future methodological development. Our dataset is available at https://huggingface.co/datasets/namkoong-lab/PersonalLLM

* 28 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:PersonalLLM: Tailoring LLMs to Individual Preferences

Paper and Code