Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Deanda, Demetrio; Masupalli, Yuktha Priya; Yang, Jeong; Lee, Young; Cao, Zechun; Liang, Gongbo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.09134 (cs)

[Submitted on 15 Jan 2025]

Title:Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Authors:Demetrio Deanda, Yuktha Priya Masupalli, Jeong Yang, Young Lee, Zechun Cao, Gongbo Liang

View PDF HTML (experimental)

Abstract:Medical images and reports offer invaluable insights into patient health. The heterogeneity and complexity of these data hinder effective analysis. To bridge this gap, we investigate contrastive learning models for cross-domain retrieval, which associates medical images with their corresponding clinical reports. This study benchmarks the robustness of four state-of-the-art contrastive learning models: CLIP, CXR-RePaiR, MedCLIP, and CXR-CLIP. We introduce an occlusion retrieval task to evaluate model performance under varying levels of image corruption. Our findings reveal that all evaluated models are highly sensitive to out-of-distribution data, as evidenced by the proportional decrease in performance with increasing occlusion levels. While MedCLIP exhibits slightly more robustness, its overall performance remains significantly behind CXR-CLIP and CXR-RePaiR. CLIP, trained on a general-purpose dataset, struggles with medical image-report retrieval, highlighting the importance of domain-specific training data. The evaluation of this work suggests that more effort needs to be spent on improving the robustness of these models. By addressing these limitations, we can develop more reliable cross-domain retrieval models for medical applications.

Comments:	This work is accepted to AAAI 2025 Workshop -- the 9th International Workshop on Health Intelligence
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2501.09134 [cs.CV]
	(or arXiv:2501.09134v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.09134

Submission history

From: Gongbo Liang [view email]
[v1] Wed, 15 Jan 2025 20:37:04 UTC (3,507 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators