LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions

Mehri, Faridoun; Baghshah, Mahdieh Soleymani; Pilehvar, Mohammad Taher

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.16760 (cs)

[Submitted on 24 Nov 2024]

Title:LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions

Authors:Faridoun Mehri (1), Mahdieh Soleymani Baghshah (1), Mohammad Taher Pilehvar (2) ((1) Sharif University of Technology, (2) Cardiff University)

View PDF

Abstract:Why do gradient-based explanations struggle with Transformers, and how can we improve them? We identify gradient flow imbalances in Transformers that violate FullGrad-completeness, a critical property for attribution faithfulness that CNNs naturally possess. To address this issue, we introduce LibraGrad -- a theoretically grounded post-hoc approach that corrects gradient imbalances through pruning and scaling of backward paths, without changing the forward pass or adding computational overhead. We evaluate LibraGrad using three metric families: Faithfulness, which quantifies prediction changes under perturbations of the most and least relevant features; Completeness Error, which measures attribution conservation relative to model outputs; and Segmentation AP, which assesses alignment with human perception. Extensive experiments across 8 architectures, 4 model sizes, and 4 datasets show that LibraGrad universally enhances gradient-based methods, outperforming existing white-box methods -- including Transformer-specific approaches -- across all metrics. We demonstrate superior qualitative results through two complementary evaluations: precise text-prompted region highlighting on CLIP models and accurate class discrimination between co-occurring animals on ImageNet-finetuned models -- two settings on which existing methods often struggle. LibraGrad is effective even on the attention-free MLP-Mixer architecture, indicating potential for extension to other modern architectures. Our code is freely available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2411.16760 [cs.CV]
	(or arXiv:2411.16760v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.16760

Submission history

From: Faridoun Mehri [view email]
[v1] Sun, 24 Nov 2024 15:02:52 UTC (36,197 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators