Exploring Context with Deep Structured models for Semantic Segmentation

Lin, Guosheng; Shen, Chunhua; Hengel, Anton van den; Reid, Ian

Computer Science > Computer Vision and Pattern Recognition

arXiv:1603.03183 (cs)

[Submitted on 10 Mar 2016 (v1), last revised 2 May 2017 (this version, v3)]

Title:Exploring Context with Deep Structured models for Semantic Segmentation

Authors:Guosheng Lin, Chunhua Shen, Anton van den Hengel, Ian Reid

View PDF

Abstract:State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore `patch-patch' context and `patch-background' context in deep CNNs. We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions. Specifically, we formulate CNN-based pairwise potential functions to capture semantic correlations between neighboring patches. Efficient piecewise training of the proposed deep structured model is then applied in order to avoid repeated expensive CRF inference during the course of back propagation. For capturing the patch-background context, we show that a network design with traditional multi-scale image inputs and sliding pyramid pooling is very effective for improving performance. We perform comprehensive evaluation of the proposed method. We achieve new state-of-the-art performance on a number of challenging semantic segmentation datasets including $NYUDv2$, $PASCAL$-$VOC2012$, $Cityscapes$, $PASCAL$-$Context$, $SUN$-$RGBD$, $SIFT$-$flow$, and $KITTI$ datasets. Particularly, we report an intersection-over-union score of $77.8$ on the $PASCAL$-$VOC2012$ dataset.

Comments:	16 pages. Accepted to IEEE T. Pattern Analysis & Machine Intelligence, 2017. Extended version of arXiv:1504.01013
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1603.03183 [cs.CV]
	(or arXiv:1603.03183v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1603.03183

Submission history

From: Chunhua Shen [view email]
[v1] Thu, 10 Mar 2016 08:34:19 UTC (8,561 KB)
[v2] Sat, 26 Mar 2016 12:24:30 UTC (8,561 KB)
[v3] Tue, 2 May 2017 08:06:42 UTC (5,145 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Exploring Context with Deep Structured models for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Exploring Context with Deep Structured models for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators