Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing

Li, Ming; Huang, Ruihong

Computer Science > Computation and Language

arXiv:2210.09537 (cs)

[Submitted on 18 Oct 2022 (v1), last revised 8 Sep 2023 (this version, v2)]

Title:Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing

Authors:Ming Li, Ruihong Huang

View PDF

Abstract:Complex feature extractors are widely employed for text representation building. However, these complex feature extractors make the NLP systems prone to overfitting especially when the downstream training datasets are relatively small, which is the case for several discourse parsing tasks. Thus, we propose an alternative lightweight neural architecture that removes multiple complex feature extractors and only utilizes learnable self-attention modules to indirectly exploit pretrained neural language models, in order to maximally preserve the generalizability of pre-trained language models. Experiments on three common discourse parsing tasks show that powered by recent pretrained language models, the lightweight architecture consisting of only two self-attention layers obtains much better generalizability and robustness. Meanwhile, it achieves comparable or even better system performance with fewer learnable parameters and less processing time.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.09537 [cs.CL]
	(or arXiv:2210.09537v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.09537

Submission history

From: Ming Li [view email]
[v1] Tue, 18 Oct 2022 02:07:09 UTC (7,731 KB)
[v2] Fri, 8 Sep 2023 05:37:35 UTC (10,606 KB)

Computer Science > Computation and Language

Title:Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators