Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Emmanuel Barajas Gonzalez

PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Nov 15, 2023

Sheshera Mysore, Zhuoran Lu, Mengting Wan, Longqi Yang, Steve Menezes, Tina Baghaee, Emmanuel Barajas Gonzalez, Jennifer Neville, Tara Safavi

Figure 1 for PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Figure 2 for PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Figure 3 for PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Figure 4 for PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Abstract:Powerful large language models have facilitated the development of writing assistants that promise to significantly improve the quality and efficiency of composition and communication. However, a barrier to effective assistance is the lack of personalization in LLM outputs to the author's communication style and specialized knowledge. In this paper, we address this challenge by proposing PEARL, a retrieval-augmented LLM writing assistant personalized with a generation-calibrated retriever. Our retriever is trained to select historic user-authored documents for prompt augmentation, such that they are likely to best personalize LLM generations for a user request. We propose two key novelties for training our retriever: 1) A training data selection method that identifies user requests likely to benefit from personalization and documents that provide that benefit; and 2) A scale-calibrating KL-divergence objective that ensures that our retriever closely tracks the benefit of a document for personalized generation. We demonstrate the effectiveness of PEARL in generating personalized workplace social media posts and Reddit comments. Finally, we showcase the potential of a generation-calibrated retriever to double as a performance predictor and further improve low-quality generations via LLM chaining.

* Pre-print, work in progress

Via

Access Paper or Ask Questions