Scalable and Robust Community Detection with Randomized Sketching

Rahmani, Mostafa; Beckus, Andre; Karimian, Adel; Atia, George

doi:10.1109/TSP.2020.2965818

Computer Science > Social and Information Networks

arXiv:1805.10927 (cs)

[Submitted on 25 May 2018 (v1), last revised 4 Dec 2022 (this version, v4)]

Title:Scalable and Robust Community Detection with Randomized Sketching

Authors:Mostafa Rahmani, Andre Beckus, Adel Karimian, George Atia

View PDF

Abstract:This article explores and analyzes the unsupervised clustering of large partially observed graphs. We propose a scalable and provable randomized framework for clustering graphs generated from the stochastic block model. The clustering is first applied to a sub-matrix of the graph's adjacency matrix associated with a reduced graph sketch constructed using random sampling. Then, the clusters of the full graph are inferred based on the clusters extracted from the sketch using a correlation-based retrieval step. Uniform random node sampling is shown to improve the computational complexity over clustering of the full graph when the cluster sizes are balanced. A new random degree-based node sampling algorithm is presented which significantly improves upon the performance of the clustering algorithm even when clusters are unbalanced. This framework improves the phase transitions for matrix-decomposition-based clustering with regard to computational complexity and minimum cluster size, which are shown to be nearly dimension-free in the low inter-cluster connectivity regime. A third sampling technique is shown to improve balance by randomly sampling nodes based on spatial distribution. We provide analysis and numerical results using a convex clustering algorithm based on matrix completion.

Subjects:	Social and Information Networks (cs.SI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1805.10927 [cs.SI]
	(or arXiv:1805.10927v4 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.1805.10927
Journal reference:	IEEE Transactions on Signal Processing, vol. 68, pp. 962-977, 2020
Related DOI:	https://doi.org/10.1109/TSP.2020.2965818

Submission history

From: Andre Beckus [view email]
[v1] Fri, 25 May 2018 17:19:13 UTC (360 KB)
[v2] Wed, 26 Dec 2018 04:52:18 UTC (364 KB)
[v3] Fri, 24 May 2019 04:20:01 UTC (990 KB)
[v4] Sun, 4 Dec 2022 01:21:03 UTC (1,913 KB)

Computer Science > Social and Information Networks

Title:Scalable and Robust Community Detection with Randomized Sketching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Scalable and Robust Community Detection with Randomized Sketching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators