AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification

Zhang, Yunlong; Shui, Zhongyi; Sun, Yunxuan; Li, Honglin; Li, Jingxiong; Zhu, Chenglu; Yang, Lin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.15303 (cs)

[Submitted on 18 Jun 2024 (v1), last revised 18 Aug 2024 (this version, v2)]

Title:AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification

Authors:Yunlong Zhang, Zhongyi Shui, Yunxuan Sun, Honglin Li, Jingxiong Li, Chenglu Zhu, Lin Yang

View PDF HTML (experimental)

Abstract:Multiple Instance Learning (MIL) has demonstrated effectiveness in analyzing whole slide images (WSIs), yet it often encounters overfitting challenges in real-world applications, particularly in the form of attention over-concentration. While existing methods to alleviate this issue introduce complex modules or processing steps, such as multiple-stage training and teacher-student distillation, this paper proposes a simple yet effective regularization: Attention Entropy Maximization (AEM). Motivated by our investigation revealing a positive correlation between attention entropy and model performance, AEM incorporates a negative entropy loss for attention values into the standard MIL framework, penalizing overly concentrated attention and encouraging the model to consider a broader range of informative regions in WSIs, potentially improving its generalization capabilities. Compared to existing overfitting mitigation methods, our AEM approach offers advantages of simplicity, efficiency, and versatility. It requires no additional modules or processing steps, involves only one hyperparameter, and demonstrates compatibility with MIL frameworks and techniques. These advantages make AEM particularly attractive for practical applications. We evaluate AEM on three benchmark datasets, demonstrating consistent performance improvements over existing methods. Furthermore, AEM shows high versatility, integrating effectively with four feature extractors, two advanced MIL frameworks, three attention mechanisms, and Subsampling augmentation technique. The source code is available at \url{this https URL}.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.15303 [cs.CV]
	(or arXiv:2406.15303v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.15303

Submission history

From: Yunlong Zhang [view email]
[v1] Tue, 18 Jun 2024 02:01:17 UTC (74,607 KB)
[v2] Sun, 18 Aug 2024 02:48:32 UTC (23,486 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators