ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision

Liang, Xie; Wei, Gao; Ming, Zhenghui; Ge, Li

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.14240 (cs)

[Submitted on 19 Apr 2025]

Title:ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision

Authors:Xie Liang, Gao Wei, Zhenghui Ming, Li Ge

View PDF HTML (experimental)

Abstract:Point cloud data is pivotal in applications like autonomous driving, virtual reality, and robotics. However, its substantial volume poses significant challenges in storage and transmission. In order to obtain a high compression ratio, crucial semantic details usually confront severe damage, leading to difficulties in guaranteeing the accuracy of downstream tasks. To tackle this problem, we are the first to introduce a novel Region of Interest (ROI)-guided Point Cloud Geometry Compression (RPCGC) method for human and machine vision. Our framework employs a dual-branch parallel structure, where the base layer encodes and decodes a simplified version of the point cloud, and the enhancement layer refines this by focusing on geometry details. Furthermore, the residual information of the enhancement layer undergoes refinement through an ROI prediction network. This network generates mask information, which is then incorporated into the residuals, serving as a strong supervision signal. Additionally, we intricately apply these mask details in the Rate-Distortion (RD) optimization process, with each point weighted in the distortion calculation. Our loss function includes RD loss and detection loss to better guide point cloud encoding for the machine. Experiment results demonstrate that RPCGC achieves exceptional compression performance and better detection accuracy (10% gain) than some learning-based compression methods at high bitrates in ScanNet and SUN RGB-D datasets.

Comments:	10 pages, 5 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2504.14240 [cs.CV]
	(or arXiv:2504.14240v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.14240
Journal reference:	ACM International Conference on Multimedia 2024

Submission history

From: Liang Xie [view email]
[v1] Sat, 19 Apr 2025 09:31:37 UTC (6,502 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators