An objective function for order preserving hierarchical clustering

Bakkelund, Daniel

Computer Science > Machine Learning

arXiv:2109.04266 (cs)

[Submitted on 9 Sep 2021 (v1), last revised 10 Dec 2024 (this version, v4)]

Title:An objective function for order preserving hierarchical clustering

Authors:Daniel Bakkelund

View PDF

Abstract:We present a theory and an objective function for similarity-based hierarchical clustering of probabilistic partial orders and directed acyclic graphs (DAGs). Specifically, given elements $x \le y$ in the partial order, and their respective clusters $[x]$ and $[y]$, the theory yields an order relation $\le'$ on the clusters such that $[x]\le'[y]$. The theory provides a concise definition of order-preserving hierarchical clustering, and offers a classification theorem identifying the order-preserving trees (dendrograms). To determine the optimal order-preserving trees, we develop an objective function that frames the problem as a bi-objective optimisation, aiming to satisfy both the order relation and the similarity measure. We prove that the optimal trees under the objective are both order-preserving and exhibit high-quality hierarchical clustering. Since finding an optimal solution is NP-hard, we introduce a polynomial-time approximation algorithm and demonstrate that the method outperforms existing methods for order-preserving hierarchical clustering by a significant margin.

Comments:	39 pages
Subjects:	Machine Learning (cs.LG); Combinatorics (math.CO)
MSC classes:	62H30, 06A06
ACM classes:	G.1.2; G.1.6; G.2.2; I.2.6; I.5.3
Cite as:	arXiv:2109.04266 [cs.LG]
	(or arXiv:2109.04266v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.04266

Submission history

From: Daniel Bakkelund [view email]
[v1] Thu, 9 Sep 2021 13:35:01 UTC (75 KB)
[v2] Fri, 31 Dec 2021 13:48:11 UTC (84 KB)
[v3] Sun, 1 May 2022 07:32:39 UTC (85 KB)
[v4] Tue, 10 Dec 2024 18:31:50 UTC (84 KB)

Computer Science > Machine Learning

Title:An objective function for order preserving hierarchical clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:An objective function for order preserving hierarchical clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators