Global Concept-Based Interpretability for Graph Neural Networks via Neuron Analysis

Xuanyuan, Han; Barbiero, Pietro; Georgiev, Dobrik; Magister, Lucie Charlotte; Lió, Pietro

Computer Science > Machine Learning

arXiv:2208.10609v1 (cs)

[Submitted on 22 Aug 2022 (this version), latest version 8 Mar 2023 (v2)]

Title:Global Concept-Based Interpretability for Graph Neural Networks via Neuron Analysis

Authors:Han Xuanyuan, Pietro Barbiero, Dobrik Georgiev, Lucie Charlotte Magister, Pietro Lió

View PDF

Abstract:Graph neural networks (GNNs) are highly effective on a variety of graph-related tasks; however, they lack interpretability and transparency. Current explainability approaches are typically local and treat GNNs as black-boxes. They do not look inside the model, inhibiting human trust in the model and explanations. Motivated by the ability of neurons to detect high-level semantic concepts in vision models, we perform a novel analysis on the behaviour of individual GNN neurons to answer questions about GNN interpretability, and propose new metrics for evaluating the interpretability of GNN neurons. We propose a novel approach for producing global explanations for GNNs using neuron-level concepts to enable practitioners to have a high-level view of the model. Specifically, (i) to the best of our knowledge, this is the first work which shows that GNN neurons act as concept detectors and have strong alignment with concepts formulated as logical compositions of node degree and neighbourhood properties; (ii) we quantitatively assess the importance of detected concepts, and identify a trade-off between training duration and neuron-level interpretability; (iii) we demonstrate that our global explainability approach has advantages over the current state-of-the-art -- we can disentangle the explanation into individual interpretable concepts backed by logical descriptions, which reduces potential for bias and improves user-friendliness.

Comments:	9 pages, 5 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2208.10609 [cs.LG]
	(or arXiv:2208.10609v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2208.10609

Submission history

From: Han Xuanyuan [view email]
[v1] Mon, 22 Aug 2022 21:30:55 UTC (2,469 KB)
[v2] Wed, 8 Mar 2023 21:10:38 UTC (12,994 KB)

Computer Science > Machine Learning

Title:Global Concept-Based Interpretability for Graph Neural Networks via Neuron Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Global Concept-Based Interpretability for Graph Neural Networks via Neuron Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators