Adversarial Dual-Student with Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation

Cao, Cong; Lin, Tianwei; He, Dongliang; Li, Fu; Yue, Huanjing; Yang, Jingyu; Ding, Errui

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.02792 (cs)

[Submitted on 5 Mar 2022 (v1), last revised 27 Sep 2022 (this version, v3)]

Title:Adversarial Dual-Student with Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation

Authors:Cong Cao, Tianwei Lin, Dongliang He, Fu Li, Huanjing Yue, Jingyu Yang, Errui Ding

View PDF

Abstract:A common challenge posed to robust semantic segmentation is the expensive data annotation cost. Existing semi-supervised solutions show great potential for solving this problem. Their key idea is constructing consistency regularization with unsupervised data augmentation from unlabeled data for model training. The perturbations for unlabeled data enable the consistency training loss, which benefits semi-supervised semantic segmentation. However, these perturbations destroy image context and introduce unnatural boundaries, which is harmful for semantic segmentation. Besides, the widely adopted semi-supervised learning framework, i.e. mean-teacher, suffers performance limitation since the student model finally converges to the teacher model. In this paper, first of all, we propose a context friendly differentiable geometric warping to conduct unsupervised data augmentation; secondly, a novel adversarial dual-student framework is proposed to improve the Mean-Teacher from the following two aspects: (1) dual student models are learned independently except for a stabilization constraint to encourage exploiting model diversities; (2) adversarial training scheme is applied to both students and the discriminators are resorted to distinguish reliable pseudo-label of unlabeled data for self-training. Effectiveness is validated via extensive experiments on PASCAL VOC2012 and Cityscapes. Our solution significantly improves the performance and state-of-the-art results are achieved on both datasets. Remarkably, compared with fully supervision, our solution achieves comparable mIoU of 73.4% using only 12.5% annotated data on PASCAL VOC2012. Our codes and models are available at this https URL.

Comments:	Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.02792 [cs.CV]
	(or arXiv:2203.02792v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.02792

Submission history

From: Cong Cao [view email]
[v1] Sat, 5 Mar 2022 17:36:17 UTC (3,342 KB)
[v2] Sat, 24 Sep 2022 04:56:51 UTC (3,900 KB)
[v3] Tue, 27 Sep 2022 09:36:56 UTC (3,887 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Adversarial Dual-Student with Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Adversarial Dual-Student with Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators