RCL: Recurrent Continuous Localization for Temporal Action Detection

Wang, Qiang; Zhang, Yanhao; Zheng, Yun; Pan, Pan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.07112 (cs)

[Submitted on 14 Mar 2022]

Title:RCL: Recurrent Continuous Localization for Temporal Action Detection

Authors:Qiang Wang, Yanhao Zhang, Yun Zheng, Pan Pan

View PDF

Abstract:Temporal representation is the cornerstone of modern action detection techniques. State-of-the-art methods mostly rely on a dense anchoring scheme, where anchors are sampled uniformly over the temporal domain with a discretized grid, and then regress the accurate boundaries. In this paper, we revisit this foundational stage and introduce Recurrent Continuous Localization (RCL), which learns a fully continuous anchoring representation. Specifically, the proposed representation builds upon an explicit model conditioned with video embeddings and temporal coordinates, which ensure the capability of detecting segments with arbitrary length. To optimize the continuous representation, we develop an effective scale-invariant sampling strategy and recurrently refine the prediction in subsequent iterations. Our continuous anchoring scheme is fully differentiable, allowing to be seamlessly integrated into existing detectors, e.g., BMN and G-TAD. Extensive experiments on two benchmarks demonstrate that our continuous representation steadily surpasses other discretized counterparts by ~2% mAP. As a result, RCL achieves 52.92% mAP@0.5 on THUMOS14 and 37.65% mAP on ActivtiyNet v1.3, outperforming all existing single-model detectors.

Comments:	9 pages, 7 figures, CVPR2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.07112 [cs.CV]
	(or arXiv:2203.07112v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.07112

Submission history

From: Qiang Wang [view email]
[v1] Mon, 14 Mar 2022 13:56:12 UTC (3,061 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RCL: Recurrent Continuous Localization for Temporal Action Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RCL: Recurrent Continuous Localization for Temporal Action Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators