Virchow: A Million-Slide Digital Pathology Foundation Model

Vorontsov, Eugene; Bozkurt, Alican; Casson, Adam; Shaikovski, George; Zelechowski, Michal; Liu, Siqi; Severson, Kristen; Zimmermann, Eric; Hall, James; Tenenholtz, Neil; Fusi, Nicolo; Mathieu, Philippe; van Eck, Alexander; Lee, Donghun; Viret, Julian; Robert, Eric; Wang, Yi Kan; Kunz, Jeremy D.; Lee, Matthew C. H.; Bernhard, Jan; Godrich, Ran A.; Oakley, Gerard; Millar, Ewan; Hanna, Matthew; Retamero, Juan; Moye, William A.; Yousfi, Razik; Kanan, Christopher; Klimstra, David; Rothrock, Brandon; Fuchs, Thomas J.

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2309.07778 (eess)

[Submitted on 14 Sep 2023 (v1), last revised 18 Jan 2024 (this version, v5)]

Title:Virchow: A Million-Slide Digital Pathology Foundation Model

Abstract:The use of artificial intelligence to enable precision medicine and decision support systems through the analysis of pathology images has the potential to revolutionize the diagnosis and treatment of cancer. Such applications will depend on models' abilities to capture the diverse patterns observed in pathology images. To address this challenge, we present Virchow, a foundation model for computational pathology. Using self-supervised learning empowered by the DINOv2 algorithm, Virchow is a vision transformer model with 632 million parameters trained on 1.5 million hematoxylin and eosin stained whole slide images from diverse tissue and specimen types, which is orders of magnitude more data than previous works. The Virchow model enables the development of a pan-cancer detection system with 0.949 overall specimen-level AUC across 17 different cancer types, while also achieving 0.937 AUC on 7 rare cancer types. The Virchow model sets the state-of-the-art on the internal and external image tile level benchmarks and slide level biomarker prediction tasks. The gains in performance highlight the importance of training on massive pathology image datasets, suggesting scaling up the data and network architecture can improve the accuracy for many high-impact computational pathology applications where limited amounts of training data are available.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
Cite as:	arXiv:2309.07778 [eess.IV]
	(or arXiv:2309.07778v5 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2309.07778

Submission history

From: Siqi Liu [view email]
[v1] Thu, 14 Sep 2023 15:09:35 UTC (12,939 KB)
[v2] Fri, 15 Sep 2023 12:32:35 UTC (12,939 KB)
[v3] Thu, 21 Sep 2023 22:08:26 UTC (12,939 KB)
[v4] Sat, 28 Oct 2023 00:01:45 UTC (12,939 KB)
[v5] Thu, 18 Jan 2024 03:55:30 UTC (15,181 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Virchow: A Million-Slide Digital Pathology Foundation Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Virchow: A Million-Slide Digital Pathology Foundation Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators