Box Supervised Video Segmentation Proposal Network

Hannan, Tanveer; Koner, Rajat; Kobold, Jonathan; Schubert, Matthias

Computer Science > Computer Vision and Pattern Recognition

arXiv:2202.07025 (cs)

[Submitted on 14 Feb 2022 (v1), last revised 16 Feb 2022 (this version, v2)]

Title:Box Supervised Video Segmentation Proposal Network

Authors:Tanveer Hannan, Rajat Koner, Jonathan Kobold, Matthias Schubert

View PDF

Abstract:Video Object Segmentation (VOS) has been targeted by various fully-supervised and self-supervised approaches. While fully-supervised methods demonstrate excellent results, self-supervised ones, which do not use pixel-level ground truth, attract much attention. However, self-supervised approaches pose a significant performance gap. Box-level annotations provide a balanced compromise between labeling effort and result quality for image segmentation but have not been exploited for the video domain. In this work, we propose a box-supervised video object segmentation proposal network, which takes advantage of intrinsic video properties. Our method incorporates object motion in the following way: first, motion is computed using a bidirectional temporal difference and a novel bounding box-guided motion compensation. Second, we introduce a novel motion-aware affinity loss that encourages the network to predict positive pixel pairs if they share similar motion and color. The proposed method outperforms the state-of-the-art self-supervised benchmark by 16.4% and 6.9% $\mathcal{J}$ &$\mathcal{F}$ score and the majority of fully supervised methods on the DAVIS and Youtube-VOS dataset without imposing network architectural specifications. We provide extensive tests and ablations on the datasets, demonstrating the robustness of our method.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2202.07025 [cs.CV]
	(or arXiv:2202.07025v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2202.07025

Submission history

From: Rajat Koner [view email]
[v1] Mon, 14 Feb 2022 20:38:28 UTC (36,107 KB)
[v2] Wed, 16 Feb 2022 22:09:01 UTC (38,681 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Box Supervised Video Segmentation Proposal Network

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Box Supervised Video Segmentation Proposal Network

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators