Hypergraph Spectral Clustering in the Weighted Stochastic Block Model

Ahn, Kwangjun; Lee, Kangwook; Suh, Changho

doi:10.1109/JSTSP.2018.2837638

Mathematics > Statistics Theory

arXiv:1805.08956 (math)

[Submitted on 23 May 2018]

Title:Hypergraph Spectral Clustering in the Weighted Stochastic Block Model

Authors:Kwangjun Ahn, Kangwook Lee, Changho Suh

View PDF

Abstract:Spectral clustering is a celebrated algorithm that partitions objects based on pairwise similarity information. While this approach has been successfully applied to a variety of domains, it comes with limitations. The reason is that there are many other applications in which only \emph{multi}-way similarity measures are available. This motivates us to explore the multi-way measurement setting. In this work, we develop two algorithms intended for such setting: Hypergraph Spectral Clustering (HSC) and Hypergraph Spectral Clustering with Local Refinement (HSCLR). Our main contribution lies in performance analysis of the poly-time algorithms under a random hypergraph model, which we name the weighted stochastic block model, in which objects and multi-way measures are modeled as nodes and weights of hyperedges, respectively. Denoting by $n$ the number of nodes, our analysis reveals the following: (1) HSC outputs a partition which is better than a random guess if the sum of edge weights (to be explained later) is $\Omega(n)$; (2) HSC outputs a partition which coincides with the hidden partition except for a vanishing fraction of nodes if the sum of edge weights is $\omega(n)$; and (3) HSCLR exactly recovers the hidden partition if the sum of edge weights is on the order of $n \log n$. Our results improve upon the state of the arts recently established under the model and they firstly settle the order-wise optimal results for the binary edge weight case. Moreover, we show that our results lead to efficient sketching algorithms for subspace clustering, a computer vision application. Lastly, we show that HSCLR achieves the information-theoretic limits for a special yet practically relevant model, thereby showing no computational barrier for the case.

Comments:	16 pages; 3 figures
Subjects:	Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:1805.08956 [math.ST]
	(or arXiv:1805.08956v1 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1805.08956
Journal reference:	October 2018 special issue on "Information-Theoretic Methods in Data Acquisition, Analysis, and Processing" of the IEEE Journal of Selected Topics in Signal Processing
Related DOI:	https://doi.org/10.1109/JSTSP.2018.2837638

Submission history

From: Kwangjun Ahn [view email]
[v1] Wed, 23 May 2018 04:26:35 UTC (160 KB)

Mathematics > Statistics Theory

Title:Hypergraph Spectral Clustering in the Weighted Stochastic Block Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Hypergraph Spectral Clustering in the Weighted Stochastic Block Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators