Global explainability of a deep abstaining classifier

Dhaubhadel, Sayera; Mohd-Yusof, Jamaludin; McMahon, Benjamin H.; Estrada, Trilce; Ganguly, Kumkum; Spannaus, Adam; Gounley, John P.; Wu, Xiao-Cheng; Durbin, Eric B.; Hanson, Heidi A.; Bhattacharya, Tanmoy

Computer Science > Machine Learning

arXiv:2504.01202 (cs)

[Submitted on 1 Apr 2025]

Title:Global explainability of a deep abstaining classifier

Authors:Sayera Dhaubhadel (1 and 2), Jamaludin Mohd-Yusof (1), Benjamin H. McMahon (1), Trilce Estrada (2), Kumkum Ganguly (1), Adam Spannaus (3), John P. Gounley (3), Xiao-Cheng Wu (4), Eric B. Durbin (5), Heidi A. Hanson (3), Tanmoy Bhattacharya (1) ((1) Los Alamos National Laboratory, (2) University of New Mexico, (3) Oak Ridge National Laboratory, (4) Louisiana Tumor Registry, (5) Kentucky Cancer Registry)

View PDF HTML (experimental)

Abstract:We present a global explainability method to characterize sources of errors in the histology prediction task of our real-world multitask convolutional neural network (MTCNN)-based deep abstaining classifier (DAC), for automated annotation of cancer pathology reports from NCI-SEER registries. Our classifier was trained and evaluated on 1.04 million hand-annotated samples and makes simultaneous predictions of cancer site, subsite, histology, laterality, and behavior for each report. The DAC framework enables the model to abstain on ambiguous reports and/or confusing classes to achieve a target accuracy on the retained (non-abstained) samples, but at the cost of decreased coverage. Requiring 97% accuracy on the histology task caused our model to retain only 22% of all samples, mostly the less ambiguous and common classes. Local explainability with the GradInp technique provided a computationally efficient way of obtaining contextual reasoning for thousands of individual predictions. Our method, involving dimensionality reduction of approximately 13000 aggregated local explanations, enabled global identification of sources of errors as hierarchical complexity among classes, label noise, insufficient information, and conflicting evidence. This suggests several strategies such as exclusion criteria, focused annotation, and reduced penalties for errors involving hierarchically related classes to iteratively improve our DAC in this complex real-world implementation.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2504.01202 [cs.LG]
	(or arXiv:2504.01202v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.01202

Submission history

From: Sayera Dhaubhadel [view email]
[v1] Tue, 1 Apr 2025 21:34:10 UTC (47,541 KB)

Computer Science > Machine Learning

Title:Global explainability of a deep abstaining classifier

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Global explainability of a deep abstaining classifier

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators