ObitoNet: Multimodal High-Resolution Point Cloud Reconstruction

Thapliyal, Apoorv; Lanka, Vinay; Baskaran, Swathi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.18775 (cs)

[Submitted on 25 Dec 2024]

Title:ObitoNet: Multimodal High-Resolution Point Cloud Reconstruction

Authors:Apoorv Thapliyal, Vinay Lanka, Swathi Baskaran

View PDF

Abstract:ObitoNet employs a Cross Attention mechanism to integrate multimodal inputs, where Vision Transformers (ViT) extract semantic features from images and a point cloud tokenizer processes geometric information using Farthest Point Sampling (FPS) and K Nearest Neighbors (KNN) for spatial structure capture. The learned multimodal features are fed into a transformer-based decoder for high-resolution point cloud reconstruction. This approach leverages the complementary strengths of both modalities rich image features and precise geometric details ensuring robust point cloud generation even in challenging conditions such as sparse or noisy data.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2412.18775 [cs.CV]
	(or arXiv:2412.18775v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.18775

Submission history

From: Apoorv Thapliyal [view email]
[v1] Wed, 25 Dec 2024 04:34:22 UTC (2,189 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2024-12

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:ObitoNet: Multimodal High-Resolution Point Cloud Reconstruction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ObitoNet: Multimodal High-Resolution Point Cloud Reconstruction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators