ProsocialDialog: A Prosocial Backbone for Conversational Agents

Kim, Hyunwoo; Yu, Youngjae; Jiang, Liwei; Lu, Ximing; Khashabi, Daniel; Kim, Gunhee; Choi, Yejin; Sap, Maarten

Computer Science > Computation and Language

arXiv:2205.12688 (cs)

[Submitted on 25 May 2022 (v1), last revised 25 Oct 2022 (this version, v2)]

Title:ProsocialDialog: A Prosocial Backbone for Conversational Agents

Authors:Hyunwoo Kim, Youngjae Yu, Liwei Jiang, Ximing Lu, Daniel Khashabi, Gunhee Kim, Yejin Choi, Maarten Sap

View PDF

Abstract:Most existing dialogue systems fail to respond properly to potentially unsafe user utterances by either ignoring or passively agreeing with them. To address this issue, we introduce ProsocialDialog, the first large-scale multi-turn dialogue dataset to teach conversational agents to respond to problematic content following social norms. Covering diverse unethical, problematic, biased, and toxic situations, ProsocialDialog contains responses that encourage prosocial behavior, grounded in commonsense social rules (i.e., rules-of-thumb, RoTs). Created via a human-AI collaborative framework, ProsocialDialog consists of 58K dialogues, with 331K utterances, 160K unique RoTs, and 497K dialogue safety labels accompanied by free-form rationales.
With this dataset, we introduce a dialogue safety detection module, Canary, capable of generating RoTs given conversational context, and a socially-informed dialogue agent, Prost. Empirical results show that Prost generates more socially acceptable dialogues compared to other state-of-the-art language and dialogue models in both in-domain and out-of-domain settings. Additionally, Canary effectively guides conversational agents and off-the-shelf language models to generate significantly more prosocial responses. Our work highlights the promise and importance of creating and steering conversational AI to be socially responsible.

Comments:	EMNLP 2022 camera ready; Dataset and model can be found at this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.12688 [cs.CL]
	(or arXiv:2205.12688v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.12688

Submission history

From: Hyunwoo Kim [view email]
[v1] Wed, 25 May 2022 11:48:47 UTC (11,151 KB)
[v2] Tue, 25 Oct 2022 08:28:58 UTC (4,048 KB)

Computer Science > Computation and Language

Title:ProsocialDialog: A Prosocial Backbone for Conversational Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ProsocialDialog: A Prosocial Backbone for Conversational Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators