Fast and Accurate Neural CRF Constituency Parsing

Zhang, Yu; Zhou, Houquan; Li, Zhenghua

doi:10.24963/ijcai.2020/560

Computer Science > Computation and Language

arXiv:2008.03736 (cs)

[Submitted on 9 Aug 2020]

Title:Fast and Accurate Neural CRF Constituency Parsing

Authors:Yu Zhang, Houquan Zhou, Zhenghua Li

View PDF

Abstract:Estimating probability distribution is one of the core issues in the NLP field. However, in both deep learning (DL) and pre-DL eras, unlike the vast applications of linear-chain CRF in sequence labeling tasks, very few works have applied tree-structure CRF to constituency parsing, mainly due to the complexity and inefficiency of the inside-outside algorithm. This work presents a fast and accurate neural CRF constituency parser. The key idea is to batchify the inside algorithm for loss computation by direct large tensor operations on GPU, and meanwhile avoid the outside algorithm for gradient computation via efficient back-propagation. We also propose a simple two-stage bracketing-then-labeling parsing approach to improve efficiency further. To improve the parsing performance, inspired by recent progress in dependency parsing, we introduce a new scoring architecture based on boundary representation and biaffine attention, and a beneficial dropout strategy. Experiments on PTB, CTB5.1, and CTB7 show that our two-stage CRF parser achieves new state-of-the-art performance on both settings of w/o and w/ BERT, and can parse over 1,000 sentences per second. We release our code at this https URL.

Comments:	IJCAI 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2008.03736 [cs.CL]
	(or arXiv:2008.03736v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2008.03736
Related DOI:	https://doi.org/10.24963/ijcai.2020/560

Submission history

From: Yu Zhang [view email]
[v1] Sun, 9 Aug 2020 14:38:48 UTC (107 KB)

Computer Science > Computation and Language

Title:Fast and Accurate Neural CRF Constituency Parsing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Fast and Accurate Neural CRF Constituency Parsing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators