On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

Wang, Xinpeng; Duan, Shitong; Yi, Xiaoyuan; Yao, Jing; Zhou, Shanlin; Wei, Zhihua; Zhang, Peng; Xu, Dongkuan; Sun, Maosong; Xie, Xing

Computer Science > Artificial Intelligence

arXiv:2403.04204 (cs)

[Submitted on 7 Mar 2024]

Title:On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

Authors:Xinpeng Wang, Shitong Duan, Xiaoyuan Yi, Jing Yao, Shanlin Zhou, Zhihua Wei, Peng Zhang, Dongkuan Xu, Maosong Sun, Xing Xie

View PDF HTML (experimental)

Abstract:Big models have achieved revolutionary breakthroughs in the field of AI, but they might also pose potential concerns. Addressing such concerns, alignment technologies were introduced to make these models conform to human preferences and values. Despite considerable advancements in the past year, various challenges lie in establishing the optimal alignment strategy, such as data cost and scalable oversight, and how to align remains an open question. In this survey paper, we comprehensively investigate value alignment approaches. We first unpack the historical context of alignment tracing back to the 1920s (where it comes from), then delve into the mathematical essence of alignment (what it is), shedding light on the inherent challenges. Following this foundation, we provide a detailed examination of existing alignment methods, which fall into three categories: Reinforcement Learning, Supervised Fine-Tuning, and In-context Learning, and demonstrate their intrinsic connections, strengths, and limitations, helping readers better understand this research area. In addition, two emerging topics, personal alignment, and multimodal alignment, are also discussed as novel frontiers in this field. Looking forward, we discuss potential alignment paradigms and how they could handle remaining challenges, prospecting where future alignment will go.

Comments:	23 pages, 7 figures
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2403.04204 [cs.AI]
	(or arXiv:2403.04204v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2403.04204

Submission history

From: Xiaoyuan Yi [view email]
[v1] Thu, 7 Mar 2024 04:19:13 UTC (5,706 KB)

Computer Science > Artificial Intelligence

Title:On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators