Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning

Lenz, Tim; Neidlinger, Peter; Ligero, Marta; Wölflein, Georg; van Treeck, Marko; Kather, Jakob Nikolas

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.13623 (cs)

[Submitted on 20 Nov 2024]

Title:Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning

Authors:Tim Lenz, Peter Neidlinger, Marta Ligero, Georg Wölflein, Marko van Treeck, Jakob Nikolas Kather

View PDF HTML (experimental)

Abstract:Representation learning of pathology whole-slide images (WSIs) has primarily relied on weak supervision with Multiple Instance Learning (MIL). This approach leads to slide representations highly tailored to a specific clinical task. Self-supervised learning (SSL) has been successfully applied to train histopathology foundation models (FMs) for patch embedding generation. However, generating patient or slide level embeddings remains challenging. Existing approaches for slide representation learning extend the principles of SSL from patch level learning to entire slides by aligning different augmentations of the slide or by utilizing multimodal data. By integrating tile embeddings from multiple FMs, we propose a new single modality SSL method in feature space that generates useful slide representations. Our contrastive pretraining strategy, called COBRA, employs multiple FMs and an architecture based on Mamba-2. COBRA exceeds performance of state-of-the-art slide encoders on four different public CPTAC cohorts on average by at least +3.8% AUC, despite only being pretrained on 3048 WSIs from TCGA. Additionally, COBRA is readily compatible at inference time with previously unseen feature extractors.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.13623 [cs.CV]
	(or arXiv:2411.13623v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.13623

Submission history

From: Tim Lenz [view email]
[v1] Wed, 20 Nov 2024 13:12:43 UTC (2,348 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators