PathVG: A New Benchmark and Dataset for Pathology Visual Grounding

Zhong, Chunlin; Hao, Shuang; Wu, Junhua; Chang, Xiaona; Jiang, Jiwei; Nie, Xiu; Tang, He; Bai, Xiang

Abstract:With the rapid development of computational pathology, many AI-assisted diagnostic tasks have emerged. Cellular nuclei segmentation can segment various types of cells for downstream analysis, but it relies on predefined categories and lacks flexibility. Moreover, pathology visual question answering can perform image-level understanding but lacks region-level detection capability. To address this, we propose a new benchmark called Pathology Visual Grounding (PathVG), which aims to detect regions based on expressions with different attributes. To evaluate PathVG, we create a new dataset named RefPath which contains 27,610 images with 33,500 language-grounded boxes. Compared to visual grounding in other domains, PathVG presents pathological images at multi-scale and contains expressions with pathological knowledge. In the experimental study, we found that the biggest challenge was the implicit information underlying the pathological expressions. Based on this, we proposed Pathology Knowledge-enhanced Network (PKNet) as the baseline model for PathVG. PKNet leverages the knowledge-enhancement capabilities of Large Language Models (LLMs) to convert pathological terms with implicit information into explicit visual features, and fuses knowledge features with expression features through the designed Knowledge Fusion Module (KFM). The proposed method achieves state-of-the-art performance on the PathVG benchmark.

Comments:	10pages, 4figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.20869 [cs.CV]
	(or arXiv:2502.20869v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.20869

Computer Science > Computer Vision and Pattern Recognition

Title:PathVG: A New Benchmark and Dataset for Pathology Visual Grounding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators