Improving Accuracy and Generalization for Efficient Visual Tracking

Zaveri, Ram; Patel, Shivang; Gu, Yu; Doretto, Gianfranco

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.18855 (cs)

[Submitted on 28 Nov 2024 (v1), last revised 6 Feb 2025 (this version, v2)]

Title:Improving Accuracy and Generalization for Efficient Visual Tracking

Authors:Ram Zaveri, Shivang Patel, Yu Gu, Gianfranco Doretto

View PDF HTML (experimental)

Abstract:Efficient visual trackers overfit to their training distributions and lack generalization abilities, resulting in them performing well on their respective in-distribution (ID) test sets and not as well on out-of-distribution (OOD) sequences, imposing limitations to their deployment in-the-wild under constrained resources. We introduce SiamABC, a highly efficient Siamese tracker that significantly improves tracking performance, even on OOD sequences. SiamABC takes advantage of new architectural designs in the way it bridges the dynamic variability of the target, and of new losses for training. Also, it directly addresses OOD tracking generalization by including a fast backward-free dynamic test-time adaptation method that continuously adapts the model according to the dynamic visual changes of the target. Our extensive experiments suggest that SiamABC shows remarkable performance gains in OOD sets while maintaining accurate performance on the ID benchmarks. SiamABC outperforms MixFormerV2-S by 7.6\% on the OOD AVisT benchmark while being 3x faster (100 FPS) on a CPU. Our code and models are available at this https URL.

Comments:	WACV 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
Cite as:	arXiv:2411.18855 [cs.CV]
	(or arXiv:2411.18855v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.18855

Submission history

From: Gianfranco Doretto [view email]
[v1] Thu, 28 Nov 2024 01:51:46 UTC (1,759 KB)
[v2] Thu, 6 Feb 2025 22:25:39 UTC (1,770 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Accuracy and Generalization for Efficient Visual Tracking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Accuracy and Generalization for Efficient Visual Tracking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators