Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Marcu, Alina; Licaret, Vlad; Costea, Dragos; Leordeanu, Marius

Computer Science > Computer Vision and Pattern Recognition

arXiv:2010.01910 (cs)

[Submitted on 2 Oct 2020]

Title:Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Authors:Alina Marcu, Vlad Licaret, Dragos Costea, Marius Leordeanu

View PDF

Abstract:Semantic segmentation is a crucial task for robot navigation and safety. However, current supervised methods require a large amount of pixelwise annotations to yield accurate results. Labeling is a tedious and time consuming process that has hampered progress in low altitude UAV applications. This paper makes an important step towards automatic annotation by introducing SegProp, a novel iterative flow-based method, with a direct connection to spectral clustering in space and time, to propagate the semantic labels to frames that lack human annotations. The labels are further used in semi-supervised learning scenarios. Motivated by the lack of a large video aerial dataset, we also introduce Ruralscapes, a new dataset with high resolution (4K) images and manually-annotated dense labels every 50 frames - the largest of its kind, to the best of our knowledge. Our novel SegProp automatically annotates the remaining unlabeled 98% of frames with an accuracy exceeding 90% (F-measure), significantly outperforming other state-of-the-art label propagation methods. Moreover, when integrating other methods as modules inside SegProp's iterative label propagation loop, we achieve a significant boost over the baseline labels. Finally, we test SegProp in a full semi-supervised setting: we train several state-of-the-art deep neural networks on the SegProp-automatically-labeled training frames and test them on completely novel videos. We convincingly demonstrate, every time, a significant improvement over the supervised scenario.

Comments:	Accepted as oral presentation at Asian Conference on Computer Vision (ACCV), 2020. arXiv admin note: text overlap with arXiv:1910.10026
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2010.01910 [cs.CV]
	(or arXiv:2010.01910v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2010.01910

Submission history

From: Alina Marcu M.Sc [view email]
[v1] Fri, 2 Oct 2020 15:15:50 UTC (11,964 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators