COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities

He, Zihao; Chu, Minh Duc; Dorn, Rebecca; Guo, Siyi; Lerman, Kristina

Computer Science > Computation and Language

arXiv:2406.12074 (cs)

[Submitted on 17 Jun 2024 (v1), last revised 22 Oct 2024 (this version, v3)]

Title:COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities

Authors:Zihao He, Minh Duc Chu, Rebecca Dorn, Siyi Guo, Kristina Lerman

View PDF HTML (experimental)

Abstract:Social scientists use surveys to probe the opinions and beliefs of populations, but these methods are slow, costly, and prone to biases. Recent advances in large language models (LLMs) enable the creating of computational representations or "digital twins" of populations that generate human-like responses mimicking the population's language, styles, and attitudes. We introduce Community-Cross-Instruct, an unsupervised framework for aligning LLMs to online communities to elicit their beliefs. Given a corpus of a community's online discussions, Community-Cross-Instruct automatically generates instruction-output pairs by an advanced LLM to (1) finetune a foundational LLM to faithfully represent that community, and (2) evaluate the alignment of the finetuned model to the community. We demonstrate the method's utility in accurately representing political and diet communities on Reddit. Unlike prior methods requiring human-authored instructions, Community-Cross-Instruct generates instructions in a fully unsupervised manner, enhancing scalability and generalization across domains. This work enables cost-effective and automated surveying of diverse online communities.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2406.12074 [cs.CL]
	(or arXiv:2406.12074v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.12074

Submission history

From: Zihao He [view email]
[v1] Mon, 17 Jun 2024 20:20:47 UTC (550 KB)
[v2] Sun, 6 Oct 2024 22:17:03 UTC (397 KB)
[v3] Tue, 22 Oct 2024 06:38:07 UTC (398 KB)

Computer Science > Computation and Language

Title:COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators