TENet: Triple Excitation Network for Video Salient Object Detection

Ren, Sucheng; Han, Chu; Yang, Xin; Han, Guoqiang; He, Shengfeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.09943 (cs)

[Submitted on 20 Jul 2020 (v1), last revised 30 Aug 2020 (this version, v2)]

Title:TENet: Triple Excitation Network for Video Salient Object Detection

Authors:Sucheng Ren, Chu Han, Xin Yang, Guoqiang Han, Shengfeng He

View PDF

Abstract:In this paper, we propose a simple yet effective approach, named Triple Excitation Network, to reinforce the training of video salient object detection (VSOD) from three aspects, spatial, temporal, and online excitations. These excitation mechanisms are designed following the spirit of curriculum learning and aim to reduce learning ambiguities at the beginning of training by selectively exciting feature activations using ground truth. Then we gradually reduce the weight of ground truth excitations by a curriculum rate and replace it by a curriculum complementary map for better and faster convergence. In particular, the spatial excitation strengthens feature activations for clear object boundaries, while the temporal excitation imposes motions to emphasize spatio-temporal salient regions. Spatial and temporal excitations can combat the saliency shifting problem and conflict between spatial and temporal features of VSOD. Furthermore, our semi-curriculum learning design enables the first online refinement strategy for VSOD, which allows exciting and boosting saliency responses during testing without re-training. The proposed triple excitations can easily plug in different VSOD methods. Extensive experiments show the effectiveness of all three excitation methods and the proposed method outperforms state-of-the-art image and video salient object detection methods.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.09943 [cs.CV]
	(or arXiv:2007.09943v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.09943

Submission history

From: Sucheng Ren [view email]
[v1] Mon, 20 Jul 2020 08:45:41 UTC (3,734 KB)
[v2] Sun, 30 Aug 2020 12:59:31 UTC (5,262 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TENet: Triple Excitation Network for Video Salient Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TENet: Triple Excitation Network for Video Salient Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators