HYDEN: Hyperbolic Density Representations for Medical Images and Reports

Qiao, Zhi; Han, Linbin; Zhen, Xiantong; Gao, Jia-Hong; Qian, Zhen

Computer Science > Artificial Intelligence

arXiv:2408.09715 (cs)

[Submitted on 19 Aug 2024 (v1), last revised 20 Aug 2024 (this version, v2)]

Title:HYDEN: Hyperbolic Density Representations for Medical Images and Reports

Authors:Zhi Qiao, Linbin Han, Xiantong Zhen, Jia-Hong Gao, Zhen Qian

View PDF HTML (experimental)

Abstract:In light of the inherent entailment relations between images and text, hyperbolic point vector embeddings, leveraging the hierarchical modeling advantages of hyperbolic space, have been utilized for visual semantic representation learning. However, point vector embedding approaches fail to address the issue of semantic uncertainty, where an image may have multiple interpretations, and text may refer to different images, a phenomenon particularly prevalent in the medical domain. Therefor, we propose \textbf{HYDEN}, a novel hyperbolic density embedding based image-text representation learning approach tailored for specific medical domain data. This method integrates text-aware local features alongside global features from images, mapping image-text features to density features in hyperbolic space via using hyperbolic pseudo-Gaussian distributions. An encapsulation loss function is employed to model the partial order relations between image-text density distributions. Experimental results demonstrate the interpretability of our approach and its superior performance compared to the baseline methods across various zero-shot tasks and different datasets.

Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2408.09715 [cs.AI]
	(or arXiv:2408.09715v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2408.09715

Submission history

From: Zhi Qiao [view email]
[v1] Mon, 19 Aug 2024 06:06:30 UTC (1,398 KB)
[v2] Tue, 20 Aug 2024 03:13:41 UTC (1,398 KB)

Computer Science > Artificial Intelligence

Title:HYDEN: Hyperbolic Density Representations for Medical Images and Reports

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:HYDEN: Hyperbolic Density Representations for Medical Images and Reports

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators