Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting

Goncharov, Nikolai; Dansereau, Donald G.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.13840 (cs)

[Submitted on 21 Nov 2024]

Title:Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting

Authors:Nikolai Goncharov, Donald G. Dansereau

View PDF HTML (experimental)

Abstract:Segmented light field images can serve as a powerful representation in many of computer vision tasks exploiting geometry and appearance of objects, such as object pose tracking. In the light field domain, segmentation presents an additional objective of recognizing the same segment through all the views. Segment Anything Model 2 (SAM 2) allows producing semantically meaningful segments for monocular images and videos. However, using SAM 2 directly on light fields is highly ineffective due to unexploited constraints. In this work, we present a novel light field segmentation method that adapts SAM 2 to the light field domain without retraining or modifying the model. By utilizing the light field domain constraints, the method produces high quality and view-consistent light field masks, outperforming the SAM 2 video tracking baseline and working 7 times faster, with a real-time speed. We achieve this by exploiting the epipolar geometry cues to propagate the masks between the views, probing the SAM 2 latent space to estimate their occlusion, and further prompting SAM 2 for their refinement.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.13840 [cs.CV]
	(or arXiv:2411.13840v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.13840

Submission history

From: Nikolai Goncharov [view email]
[v1] Thu, 21 Nov 2024 05:01:49 UTC (18,149 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators