Shuffled Graph Classification: Theory and Connectome Applications

Vogelstein, Joshua T.; Priebe, Carey E.

Quantitative Biology > Quantitative Methods

arXiv:1112.5506v1 (q-bio)

[Submitted on 23 Dec 2011 (this version), latest version 16 Oct 2012 (v2)]

Title:Shuffled Graph Classification: Theory and Connectome Applications

Authors:Joshua T. Vogelstein, Carey E. Priebe

View PDF

Abstract:In this work, we investigate the extent to which shuffling vertex labels can hinder classification performance, and for which random graph models one might expect this shuffling to be impactful. Via theory we demonstrate a collection of results. Specifically, if one "shuffles" the graphs prior to classification, the vertex label information is irretrievably lost, which can degrade classification performance (and often does). A specific graph-invariant classifier is shown to be Bayes optimal. Moreover, this classifier may be induced by training data consistently and efficiently. Unfortunately, both computational and sample size burdens make this "plugin" classifier impractical. A graph-matched Frobenius norm k nearest neighbor (kNN) classifier, however, is also universally consistent as the number of training samples goes to infinity, and is computationally tractable. Finally, we apply this approach to a connectome classification problem (a connectome is brain-graph where vertices correspond to (collections of) neurons). The graph-matched kNN classifier on the shuffled graphs performs better than a typical graph-invariant kNN strategy, but not quite as well as the kNN on the labeled graphs, on a real connectome classification problem. Thus, we demonstrate the practical utility of the theoretical derivations herein. Extending these results to weighted and (certain) attributed random graph models is straightforward.

Comments:	7 pages, 1 figure
Subjects:	Quantitative Methods (q-bio.QM); Statistics Theory (math.ST)
Cite as:	arXiv:1112.5506 [q-bio.QM]
	(or arXiv:1112.5506v1 [q-bio.QM] for this version)
	https://doi.org/10.48550/arXiv.1112.5506

Submission history

From: Joshua Vogelstein [view email]
[v1] Fri, 23 Dec 2011 02:47:31 UTC (28 KB)
[v2] Tue, 16 Oct 2012 09:57:12 UTC (25 KB)

Quantitative Biology > Quantitative Methods

Title:Shuffled Graph Classification: Theory and Connectome Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Quantitative Methods

Title:Shuffled Graph Classification: Theory and Connectome Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators