On Self-Distilling Graph Neural Network

Chen, Yuzhao; Bian, Yatao; Xiao, Xi; Rong, Yu; Xu, Tingyang; Huang, Junzhou

Computer Science > Machine Learning

arXiv:2011.02255 (cs)

[Submitted on 4 Nov 2020 (v1), last revised 30 Apr 2021 (this version, v2)]

Title:On Self-Distilling Graph Neural Network

Authors:Yuzhao Chen, Yatao Bian, Xi Xiao, Yu Rong, Tingyang Xu, Junzhou Huang

View PDF

Abstract:Recently, the teacher-student knowledge distillation framework has demonstrated its potential in training Graph Neural Networks (GNNs). However, due to the difficulty of training over-parameterized GNN models, one may not easily obtain a satisfactory teacher model for distillation. Furthermore, the inefficient training process of teacher-student knowledge distillation also impedes its applications in GNN models. In this paper, we propose the first teacher-free knowledge distillation method for GNNs, termed GNN Self-Distillation (GNN-SD), that serves as a drop-in replacement of the standard training process. The method is built upon the proposed neighborhood discrepancy rate (NDR), which quantifies the non-smoothness of the embedded graph in an efficient way. Based on this metric, we propose the adaptive discrepancy retaining (ADR) regularizer to empower the transferability of knowledge that maintains high neighborhood discrepancy across GNN layers. We also summarize a generic GNN-SD framework that could be exploited to induce other distillation strategies. Experiments further prove the effectiveness and generalization of our approach, as it brings: 1) state-of-the-art GNN distillation performance with less training cost, 2) consistent and considerable performance enhancement for various popular backbones.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2011.02255 [cs.LG]
	(or arXiv:2011.02255v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2011.02255

Submission history

From: Yuzhao Chen [view email]
[v1] Wed, 4 Nov 2020 12:29:33 UTC (1,100 KB)
[v2] Fri, 30 Apr 2021 04:31:53 UTC (786 KB)

Computer Science > Machine Learning

Title:On Self-Distilling Graph Neural Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Self-Distilling Graph Neural Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators