Do Histopathological Foundation Models Eliminate Batch Effects? A Comparative Study

Kömen, Jonah; Marienwald, Hannah; Dippel, Jonas; Hense, Julius

Computer Science > Machine Learning

arXiv:2411.05489 (cs)

[Submitted on 8 Nov 2024]

Title:Do Histopathological Foundation Models Eliminate Batch Effects? A Comparative Study

Authors:Jonah Kömen, Hannah Marienwald, Jonas Dippel, Julius Hense

View PDF HTML (experimental)

Abstract:Deep learning has led to remarkable advancements in computational histopathology, e.g., in diagnostics, biomarker prediction, and outcome prognosis. Yet, the lack of annotated data and the impact of batch effects, e.g., systematic technical data differences across hospitals, hamper model robustness and generalization. Recent histopathological foundation models -- pretrained on millions to billions of images -- have been reported to improve generalization performances on various downstream tasks. However, it has not been systematically assessed whether they fully eliminate batch effects. In this study, we empirically show that the feature embeddings of the foundation models still contain distinct hospital signatures that can lead to biased predictions and misclassifications. We further find that the signatures are not removed by stain normalization methods, dominate distances in feature space, and are evident across various principal components. Our work provides a novel perspective on the evaluation of medical foundation models, paving the way for more robust pretraining strategies and downstream predictors.

Comments:	Accepted to AIM-FM Workshop @ NeurIPS'24
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.05489 [cs.LG]
	(or arXiv:2411.05489v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.05489

Submission history

From: Julius Hense [view email]
[v1] Fri, 8 Nov 2024 11:39:03 UTC (5,215 KB)

Computer Science > Machine Learning

Title:Do Histopathological Foundation Models Eliminate Batch Effects? A Comparative Study

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Do Histopathological Foundation Models Eliminate Batch Effects? A Comparative Study

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators