Pearl: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Mysore, Sheshera; Lu, Zhuoran; Wan, Mengting; Yang, Longqi; Sarrafzadeh, Bahareh; Menezes, Steve; Baghaee, Tina; Gonzalez, Emmanuel Barajas; Neville, Jennifer; Safavi, Tara

Computer Science > Computation and Language

arXiv:2311.09180 (cs)

[Submitted on 15 Nov 2023 (v1), last revised 5 Nov 2024 (this version, v2)]

Title:Pearl: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Authors:Sheshera Mysore, Zhuoran Lu, Mengting Wan, Longqi Yang, Bahareh Sarrafzadeh, Steve Menezes, Tina Baghaee, Emmanuel Barajas Gonzalez, Jennifer Neville, Tara Safavi

View PDF HTML (experimental)

Abstract:Powerful large language models have facilitated the development of writing assistants that promise to significantly improve the quality and efficiency of composition and communication. However, a barrier to effective assistance is the lack of personalization in LLM outputs to the author's communication style, specialized knowledge, and values. In this paper, we address this challenge by proposing Pearl, a LLM writing assistant personalized with a retriever that is trained to be generation-calibrated for personalization. Generation calibration ensures that our retriever selects historic user authored documents to augment an LLM prompt such that they are likely to help an LLM generation better adhere to a users' preferences. We propose two key novelties for training such a retriever: (1) A training data selection method that identifies user requests likely to benefit from personalization and documents that provide that benefit; and (2) A scale-calibrating KL-divergence objective that ensures that our retriever scores remain proportional to the downstream generation quality from using the document for personalized generation. In a series of holistic evaluations, we demonstrate the effectiveness of Pearl in generating long-form texts on multiple social media datasets. Finally, we demonstrate how a generation-calibrated retriever can double as a performance predictor -- detecting low quality retrieval, and improving potentially under-performing outputs via revision with LLMs.

Comments:	Accepted to Workshop on Customizable NLP at EMNLP 2024
Subjects:	Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
Cite as:	arXiv:2311.09180 [cs.CL]
	(or arXiv:2311.09180v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.09180

Submission history

From: Sheshera Mysore [view email]
[v1] Wed, 15 Nov 2023 18:19:58 UTC (1,283 KB)
[v2] Tue, 5 Nov 2024 03:34:10 UTC (1,872 KB)

Computer Science > Computation and Language

Title:Pearl: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Pearl: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators