ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data

Gu, Haoran; Wang, Handing; Mei, Yi; Zhang, Mengjie; Jin, Yaochu

Computer Science > Machine Learning

arXiv:2504.16628 (cs)

[Submitted on 23 Apr 2025]

Title:ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data

Authors:Haoran Gu, Handing Wang, Yi Mei, Mengjie Zhang, Yaochu Jin

View PDF HTML (experimental)

Abstract:Aligning large language models with multiple human expectations and values is crucial for ensuring that they adequately serve a variety of user needs. To this end, offline multiobjective alignment algorithms such as the Rewards-in-Context algorithm have shown strong performance and efficiency. However, inappropriate preference representations and training with imbalanced reward scores limit the performance of such algorithms. In this work, we introduce ParetoHqD that addresses the above issues by representing human preferences as preference directions in the objective space and regarding data near the Pareto front as ''high-quality'' data. For each preference, ParetoHqD follows a two-stage supervised fine-tuning process, where each stage uses an individual Pareto high-quality training set that best matches its preference direction. The experimental results have demonstrated the superiority of ParetoHqD over five baselines on two multiobjective alignment tasks.

Comments:	19 pages, 6 figure, Multiobjective Alignment of LLMs
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2504.16628 [cs.LG]
	(or arXiv:2504.16628v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.16628

Submission history

From: Haoran Gu [view email]
[v1] Wed, 23 Apr 2025 11:35:57 UTC (1,353 KB)

Computer Science > Machine Learning

Title:ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators