Knowledge Distillation Decision Tree for Unravelling Black-box Machine Learning Models

Lu, Xuetao; Lee, J. Jack

Statistics > Methodology

arXiv:2206.04661 (stat)

[Submitted on 9 Jun 2022 (v1), last revised 4 Apr 2025 (this version, v4)]

Title:Knowledge Distillation Decision Tree for Unravelling Black-box Machine Learning Models

Authors:Xuetao Lu, J. Jack Lee

View PDF

Abstract:Machine learning models, particularly the black-box models, are widely favored for their outstanding predictive capabilities. However, they often face scrutiny and criticism due to the lack of interpretability. Paradoxically, their strong predictive capabilities may indicate a deep understanding of the underlying data, implying significant potential for interpretation. Leveraging the emerging concept of knowledge distillation, we introduce the method of knowledge distillation decision tree (KDDT). This method enables the distillation of knowledge about the data from a black-box model into a decision tree, thereby facilitating the interpretation of the black-box model. Essential attributes for a good interpretable model include simplicity, stability, and predictivity. The primary challenge of constructing interpretable tree lies in ensuring structural stability under the randomness of the training data. KDDT is developed with the theoretical foundations demonstrating that structure stability can be achieved under mild assumptions. Furthermore, we propose the hybrid KDDT to achieve both simplicity and predictivity. An efficient algorithm is provided for constructing the hybrid KDDT. Simulation studies and a real-data analysis validate the hybrid KDDT's capability to deliver accurate and reliable interpretations. KDDT is an excellent interpretable model with great potential for practical applications.

Subjects:	Methodology (stat.ME); Machine Learning (cs.LG)
Cite as:	arXiv:2206.04661 [stat.ME]
	(or arXiv:2206.04661v4 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2206.04661

Submission history

From: Xuetao Lu [view email]
[v1] Thu, 9 Jun 2022 17:57:37 UTC (3,228 KB)
[v2] Fri, 29 Sep 2023 21:46:00 UTC (6,766 KB)
[v3] Tue, 3 Oct 2023 03:11:20 UTC (6,765 KB)
[v4] Fri, 4 Apr 2025 18:13:02 UTC (9,691 KB)

Statistics > Methodology

Title:Knowledge Distillation Decision Tree for Unravelling Black-box Machine Learning Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:Knowledge Distillation Decision Tree for Unravelling Black-box Machine Learning Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators