Hyperbolic Contrastive Learning for Hierarchical 3D Point Cloud Embedding

Liu, Yingjie; Zhang, Pengyu; He, Ziyao; Chen, Mingsong; Tang, Xuan; Wei, Xian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.02285 (cs)

[Submitted on 4 Jan 2025 (v1), last revised 7 Jan 2025 (this version, v2)]

Title:Hyperbolic Contrastive Learning for Hierarchical 3D Point Cloud Embedding

Authors:Yingjie Liu, Pengyu Zhang, Ziyao He, Mingsong Chen, Xuan Tang, Xian Wei

View PDF HTML (experimental)

Abstract:Hyperbolic spaces allow for more efficient modeling of complex, hierarchical structures, which is particularly beneficial in tasks involving multi-modal data. Although hyperbolic geometries have been proven effective for language-image pre-training, their capabilities to unify language, image, and 3D Point Cloud modalities are under-explored. We extend the 3D Point Cloud modality in hyperbolic multi-modal contrastive pre-training. Additionally, we explore the entailment, modality gap, and alignment regularizers for learning hierarchical 3D embeddings and facilitating the transfer of knowledge from both Text and Image modalities. These regularizers enable the learning of intra-modal hierarchy within each modality and inter-modal hierarchy across text, 2D images, and 3D Point Clouds. Experimental results demonstrate that our proposed training strategy yields an outstanding 3D Point Cloud encoder, and the obtained 3D Point Cloud hierarchical embeddings significantly improve performance on various downstream tasks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.02285 [cs.CV]
	(or arXiv:2501.02285v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.02285

Submission history

From: Yingjie Liu [view email]
[v1] Sat, 4 Jan 2025 13:27:18 UTC (1,893 KB)
[v2] Tue, 7 Jan 2025 13:38:34 UTC (1,893 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Hyperbolic Contrastive Learning for Hierarchical 3D Point Cloud Embedding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Hyperbolic Contrastive Learning for Hierarchical 3D Point Cloud Embedding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators