A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology

Keshvarikhojasteh, Hassan; Tifrea, Mihail; Hess, Sibylle; Pluim, Josien P. W.; Veta, Mitko

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2504.17379 (eess)

[Submitted on 24 Apr 2025]

Title:A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology

Authors:Hassan Keshvarikhojasteh, Mihail Tifrea, Sibylle Hess, Josien P.W. Pluim, Mitko Veta

View PDF HTML (experimental)

Abstract:Multiple instance learning (MIL) is a promising approach for weakly supervised classification in pathology using whole slide images (WSIs). However, conventional MIL methods such as Attention-Based Deep Multiple Instance Learning (ABMIL) typically disregard spatial interactions among patches that are crucial to pathological diagnosis. Recent advancements, such as Transformer based MIL (TransMIL), have incorporated spatial context and inter-patch relationships. However, it remains unclear whether explicitly modeling patch relationships yields similar performance gains in ABMIL, which relies solely on Multi-Layer Perceptrons (MLPs). In contrast, TransMIL employs Transformer-based layers, introducing a fundamental architectural shift at the cost of substantially increased computational complexity. In this work, we enhance the ABMIL framework by integrating interaction-aware representations to address this question. Our proposed model, Global ABMIL (GABMIL), explicitly captures inter-instance dependencies while preserving computational efficiency. Experimental results on two publicly available datasets for tumor subtyping in breast and lung cancers demonstrate that GABMIL achieves up to a 7 percentage point improvement in AUPRC and a 5 percentage point increase in the Kappa score over ABMIL, with minimal or no additional computational overhead. These findings underscore the importance of incorporating patch interactions within MIL frameworks.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.17379 [eess.IV]
	(or arXiv:2504.17379v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2504.17379

Submission history

From: Hassan Keshvarikhojasteh [view email]
[v1] Thu, 24 Apr 2025 08:53:46 UTC (9,436 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators