MReD: A Meta-Review Dataset for Structure-Controllable Text Generation

Shen, Chenhui; Cheng, Liying; Zhou, Ran; Bing, Lidong; You, Yang; Si, Luo

Computer Science > Computation and Language

arXiv:2110.07474 (cs)

[Submitted on 14 Oct 2021 (v1), last revised 5 Jul 2022 (this version, v6)]

Title:MReD: A Meta-Review Dataset for Structure-Controllable Text Generation

Authors:Chenhui Shen, Liying Cheng, Ran Zhou, Lidong Bing, Yang You, Luo Si

View PDF

Abstract:When directly using existing text generation datasets for controllable generation, we are facing the problem of not having the domain knowledge and thus the aspects that could be controlled are limited. A typical example is when using CNN/Daily Mail dataset for controllable text summarization, there is no guided information on the emphasis of summary sentences. A more useful text generator should leverage both the input text and the control signal to guide the generation, which can only be built with a deep understanding of the domain knowledge. Motivated by this vision, our paper introduces a new text generation dataset, named MReD. Our new dataset consists of 7,089 meta-reviews and all its 45k meta-review sentences are manually annotated with one of the 9 carefully defined categories, including abstract, strength, decision, etc. We present experimental results on start-of-the-art summarization models, and propose methods for structure-controlled generation with both extractive and abstractive models using our annotated data. By exploring various settings and analyzing the model behavior with respect to the control signal, we demonstrate the challenges of our proposed task and the values of our dataset MReD. Meanwhile, MReD also allows us to have a better understanding of the meta-review domain.

Comments:	15 pages, 5 figures, accepted at ACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2110.07474 [cs.CL]
	(or arXiv:2110.07474v6 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2110.07474

Submission history

From: Liying Cheng [view email]
[v1] Thu, 14 Oct 2021 15:48:03 UTC (2,223 KB)
[v2] Thu, 24 Mar 2022 11:36:37 UTC (4,569 KB)
[v3] Mon, 28 Mar 2022 07:02:45 UTC (4,568 KB)
[v4] Mon, 4 Apr 2022 09:47:08 UTC (4,568 KB)
[v5] Mon, 11 Apr 2022 04:07:34 UTC (4,568 KB)
[v6] Tue, 5 Jul 2022 07:43:44 UTC (3,137 KB)

Computer Science > Computation and Language

Title:MReD: A Meta-Review Dataset for Structure-Controllable Text Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MReD: A Meta-Review Dataset for Structure-Controllable Text Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators