PiCANet: Learning Pixel-wise Contextual Attention for Saliency Detection

Liu, Nian; Han, Junwei; Yang, Ming-Hsuan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1708.06433v2 (cs)

[Submitted on 21 Aug 2017 (v1), last revised 3 Apr 2018 (this version, v2)]

Title:PiCANet: Learning Pixel-wise Contextual Attention for Saliency Detection

Authors:Nian Liu, Junwei Han, Ming-Hsuan Yang

View PDF

Abstract:Contexts play an important role in the saliency detection task. However, given a context region, not all contextual information is helpful for the final task. In this paper, we propose a novel pixel-wise contextual attention network, i.e., the PiCANet, to learn to selectively attend to informative context locations for each pixel. Specifically, for each pixel, it can generate an attention map in which each attention weight corresponds to the contextual relevance at each context location. An attended contextual feature can then be constructed by selectively aggregating the contextual information. We formulate the proposed PiCANet in both global and local forms to attend to global and local contexts, respectively. Both models are fully differentiable and can be embedded into CNNs for joint training. We also incorporate the proposed models with the U-Net architecture to detect salient objects. Extensive experiments show that the proposed PiCANets can consistently improve saliency detection performance. The global and local PiCANets facilitate learning global contrast and homogeneousness, respectively. As a result, our saliency model can detect salient objects more accurately and uniformly, thus performing favorably against the state-of-the-art methods.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1708.06433 [cs.CV]
	(or arXiv:1708.06433v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1708.06433

Submission history

From: Nian Liu [view email]
[v1] Mon, 21 Aug 2017 22:12:45 UTC (1,027 KB)
[v2] Tue, 3 Apr 2018 09:50:03 UTC (3,260 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PiCANet: Learning Pixel-wise Contextual Attention for Saliency Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PiCANet: Learning Pixel-wise Contextual Attention for Saliency Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators