Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

Zhang, Kaixuan; Wang, Qinglong; Liu, Xue; Giles, C. Lee

Computer Science > Machine Learning

arXiv:1910.06509 (cs)

[Submitted on 15 Oct 2019]

Title:Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

Authors:Kaixuan Zhang, Qinglong Wang, Xue Liu, C. Lee Giles

View PDF

Abstract:Data samples collected for training machine learning models are typically assumed to be independent and identically distributed (iid). Recent research has demonstrated that this assumption can be problematic as it simplifies the manifold of structured data. This has motivated different research areas such as data poisoning, model improvement, and explanation of machine learning models. In this work, we study the influence of a sample on determining the intrinsic topological features of its underlying manifold. We propose the Shapley Homology framework, which provides a quantitative metric for the influence of a sample of the homology of a simplicial complex. By interpreting the influence as a probability measure, we further define an entropy which reflects the complexity of the data manifold. Our empirical studies show that when using the 0-dimensional homology, on neighboring graphs, samples with higher influence scores have more impact on the accuracy of neural networks for determining the graph connectivity and on several regular grammars whose higher entropy values imply more difficulty in being learned.

Subjects:	Machine Learning (cs.LG); Algebraic Topology (math.AT); Machine Learning (stat.ML)
Cite as:	arXiv:1910.06509 [cs.LG]
	(or arXiv:1910.06509v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1910.06509

Submission history

From: Kaixuan Zhang [view email]
[v1] Tue, 15 Oct 2019 03:40:45 UTC (1,815 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-10

Change to browse by:

cs
math
math.AT
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kaixuan Zhang
Qinglong Wang
Xue Liu
C. Lee Giles

export BibTeX citation

Computer Science > Machine Learning

Title:Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators