Hibou: A Family of Foundational Vision Transformers for Pathology

Nechaev, Dmitry; Pchelnikov, Alexey; Ivanova, Ekaterina

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2406.05074 (eess)

[Submitted on 7 Jun 2024 (v1), last revised 20 Aug 2024 (this version, v2)]

Title:Hibou: A Family of Foundational Vision Transformers for Pathology

Authors:Dmitry Nechaev, Alexey Pchelnikov, Ekaterina Ivanova

View PDF HTML (experimental)

Abstract:Pathology, the microscopic examination of diseased tissue, is critical for diagnosing various medical conditions, particularly cancers. Traditional methods are labor-intensive and prone to human error. Digital pathology, which converts glass slides into high-resolution digital images for analysis by computer algorithms, revolutionizes the field by enhancing diagnostic accuracy, consistency, and efficiency through automated image analysis and large-scale data processing. Foundational transformer pretraining is crucial for developing robust, generalizable models as it enables learning from vast amounts of unannotated data.
This paper introduces the Hibou family of foundational vision transformers for pathology, leveraging the DINOv2 framework to pretrain two model variants, Hibou-B and Hibou-L, on a proprietary dataset of over 1 million whole slide images (WSIs) representing diverse tissue types and staining techniques. Our pretrained models demonstrate superior performance on both patch-level and slide-level benchmarks, surpassing existing state-of-the-art methods. Notably, Hibou-L achieves the highest average accuracy across multiple benchmark datasets. To support further research and application in the field, we have open-sourced the Hibou models, which can be accessed at this https URL.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.05074 [eess.IV]
	(or arXiv:2406.05074v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2406.05074

Submission history

From: Dmitry Nechaev [view email]
[v1] Fri, 7 Jun 2024 16:45:53 UTC (110 KB)
[v2] Tue, 20 Aug 2024 11:01:21 UTC (113 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Hibou: A Family of Foundational Vision Transformers for Pathology

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Hibou: A Family of Foundational Vision Transformers for Pathology

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators