CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

Javed, Sajid; Mahmood, Arif; Ganapathi, Iyyakutti Iyappan; Dharejo, Fayaz Ali; Werghi, Naoufel; Bennamoun, Mohammed

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.05205 (cs)

[Submitted on 7 Jun 2024]

Title:CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

Authors:Sajid Javed, Arif Mahmood, Iyyakutti Iyappan Ganapathi, Fayaz Ali Dharejo, Naoufel Werghi, Mohammed Bennamoun

View PDF HTML (experimental)

Abstract:This paper proposes Comprehensive Pathology Language Image Pre-training (CPLIP), a new unsupervised technique designed to enhance the alignment of images and text in histopathology for tasks such as classification and segmentation. This methodology enriches vision-language models by leveraging extensive data without needing ground truth annotations. CPLIP involves constructing a pathology-specific dictionary, generating textual descriptions for images using language models, and retrieving relevant images for each text snippet via a pre-trained model. The model is then fine-tuned using a many-to-many contrastive learning method to align complex interrelated concepts across both modalities. Evaluated across multiple histopathology tasks, CPLIP shows notable improvements in zero-shot learning scenarios, outperforming existing methods in both interpretability and robustness and setting a higher benchmark for the application of vision-language models in the field. To encourage further research and replication, the code for CPLIP is available on GitHub at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
Cite as:	arXiv:2406.05205 [cs.CV]
	(or arXiv:2406.05205v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.05205

Submission history

From: Arif Mahmood [view email]
[v1] Fri, 7 Jun 2024 18:39:58 UTC (20,250 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators