Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Zhang, Zhaowei; Bai, Fengshuo; Chen, Qizhi; Ma, Chengdong; Wang, Mingzhi; Sun, Haoran; Zheng, Zilong; Yang, Yaodong

Computer Science > Computation and Language

arXiv:2502.19148 (cs)

[Submitted on 26 Feb 2025]

Title:Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Authors:Zhaowei Zhang, Fengshuo Bai, Qizhi Chen, Chengdong Ma, Mingzhi Wang, Haoran Sun, Zilong Zheng, Yaodong Yang

View PDF

Abstract:How to align large language models (LLMs) with user preferences from a static general dataset has been frequently studied. However, user preferences are usually personalized, changing, and diverse regarding culture, values, or time. This leads to the problem that the actual user preferences often do not coincide with those trained by the model developers in the practical use of LLMs. Since we cannot collect enough data and retrain for every demand, researching efficient real-time preference adaptation methods based on the backbone LLMs during test time is important. To this end, we introduce Amulet, a novel, training-free framework that formulates the decoding process of every token as a separate online learning problem with the guidance of simple user-provided prompts, thus enabling real-time optimization to satisfy users' personalized preferences. To reduce the computational cost brought by this optimization process for each token, we additionally provide a closed-form solution for each iteration step of the optimization process, thereby reducing the computational time cost to a negligible level. The detailed experimental results demonstrate that Amulet can achieve significant performance improvements in rich settings with combinations of different LLMs, datasets, and user preferences, while maintaining acceptable computational efficiency.

Comments:	Accepted by ICLR 2025, Project page: this https URL
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
ACM classes:	I.2.7
Cite as:	arXiv:2502.19148 [cs.CL]
	(or arXiv:2502.19148v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.19148

Submission history

From: Zhaowei Zhang [view email]
[v1] Wed, 26 Feb 2025 14:07:37 UTC (594 KB)

Computer Science > Computation and Language

Title:Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators