Unbiased and Efficient Sampling of Dependency Trees

Stanojević, Miloš

Computer Science > Computation and Language

arXiv:2205.12621 (cs)

[Submitted on 25 May 2022 (v1), last revised 28 Nov 2022 (this version, v2)]

Title:Unbiased and Efficient Sampling of Dependency Trees

Authors:Miloš Stanojević

View PDF

Abstract:Most computational models of dependency syntax consist of distributions over spanning trees. However, the majority of dependency treebanks require that every valid dependency tree has a single edge coming out of the ROOT node, a constraint that is not part of the definition of spanning trees. For this reason all standard inference algorithms for spanning trees are suboptimal for inference over dependency trees.
Zmigrod et al. (2021b) proposed algorithms for sampling with and without replacement from the dependency tree distribution that incorporate the single-root constraint. In this paper we show that their fastest algorithm for sampling with replacement, Wilson-RC, is in fact producing biased samples and we provide two alternatives that are unbiased. Additionally, we propose two algorithms (one incremental, one parallel) that reduce the asymptotic runtime of algorithm for sampling k trees without replacement to O(kn3). These algorithms are both asymptotically and practically more efficient.

Comments:	16 pages, 4 algorithms, 7 figures
Subjects:	Computation and Language (cs.CL); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:2205.12621 [cs.CL]
	(or arXiv:2205.12621v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.12621

Submission history

From: Miloš Stanojević [view email]
[v1] Wed, 25 May 2022 09:57:28 UTC (224 KB)
[v2] Mon, 28 Nov 2022 14:02:59 UTC (264 KB)

Computer Science > Computation and Language

Title:Unbiased and Efficient Sampling of Dependency Trees

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Unbiased and Efficient Sampling of Dependency Trees

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators