A Benchmark for Understanding and Generating Dialogue between Characters in Stories

Yao, Jianzhu; Liu, Ziqi; Guan, Jian; Huang, Minlie

Computer Science > Computation and Language

arXiv:2209.08524 (cs)

[Submitted on 18 Sep 2022 (v1), last revised 12 Dec 2022 (this version, v2)]

Title:A Benchmark for Understanding and Generating Dialogue between Characters in Stories

Authors:Jianzhu Yao, Ziqi Liu, Jian Guan, Minlie Huang

View PDF

Abstract:Many classical fairy tales, fiction, and screenplays leverage dialogue to advance story plots and establish characters. We present the first study to explore whether machines can understand and generate dialogue in stories, which requires capturing traits of different characters and the relationships between them. To this end, we propose two new tasks including Masked Dialogue Generation and Dialogue Speaker Recognition, i.e., generating missing dialogue turns and predicting speakers for specified dialogue turns, respectively. We build a new dataset DialStory, which consists of 105k Chinese stories with a large amount of dialogue weaved into the plots to support the evaluation. We show the difficulty of the proposed tasks by testing existing models with automatic and manual evaluation on DialStory. Furthermore, we propose to learn explicit character representations to improve performance on these tasks. Extensive experiments and case studies show that our approach can generate more coherent and informative dialogue, and achieve higher speaker recognition accuracy than strong baselines.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2209.08524 [cs.CL]
	(or arXiv:2209.08524v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2209.08524

Submission history

From: Jianzhu Yao [view email]
[v1] Sun, 18 Sep 2022 10:19:04 UTC (17,857 KB)
[v2] Mon, 12 Dec 2022 02:32:09 UTC (24,905 KB)

Computer Science > Computation and Language

Title:A Benchmark for Understanding and Generating Dialogue between Characters in Stories

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Benchmark for Understanding and Generating Dialogue between Characters in Stories

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators