Shazam: Unifying Multiple Foundation Models for Advanced Computational Pathology

Lei, Wenhui; Li, Anqi; Tan, Yusheng; Chen, Hanyu; Zhang, Xiaofan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.00736 (cs)

[Submitted on 2 Mar 2025 (v1), last revised 6 Mar 2025 (this version, v2)]

Title:Shazam: Unifying Multiple Foundation Models for Advanced Computational Pathology

Authors:Wenhui Lei, Anqi Li, Yusheng Tan, Hanyu Chen, Xiaofan Zhang

View PDF HTML (experimental)

Abstract:Foundation Models (FMs) in computational pathology (CPath) have significantly advanced the extraction of meaningful features from histopathology image datasets, achieving strong performance across various clinical tasks. Despite their impressive performance, these models often exhibit variability when applied to different tasks, prompting the need for a unified framework capable of consistently excelling across various applications. In this work, we propose Shazam, a novel framework designed to efficiently combine multiple CPath models. Unlike previous approaches that train a fixed-parameter FM, Shazam dynamically extracts and refines information from diverse FMs for each specific task. To ensure that each FM contributes effectively without dominance, a novel distillation strategy is applied, guiding the student model with features from all teacher models, which enhances its generalization ability. Experimental results on two pathology patch classification datasets demonstrate that Shazam outperforms existing CPath models and other fusion methods. Its lightweight, flexible design makes it a promising solution for improving CPath analysis in real-world settings. Code will be available at this https URL.

Comments:	9 pages, 2 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.00736 [cs.CV]
	(or arXiv:2503.00736v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.00736

Submission history

From: WenHui Lei [view email]
[v1] Sun, 2 Mar 2025 05:20:41 UTC (2,466 KB)
[v2] Thu, 6 Mar 2025 03:35:09 UTC (2,466 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Shazam: Unifying Multiple Foundation Models for Advanced Computational Pathology

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Shazam: Unifying Multiple Foundation Models for Advanced Computational Pathology

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators