Siamese Learning Visual Tracking: A Survey

Pflugfelder, Roman

Computer Science > Computer Vision and Pattern Recognition

arXiv:1707.00569v1 (cs)

[Submitted on 3 Jul 2017 (this version), latest version 2 Aug 2018 (v2)]

Title:Siamese Learning Visual Tracking: A Survey

Authors:Roman Pflugfelder

View PDF

Abstract:The aim of this survey is an attempt to review the kind of machine learning and stochastic techniques and the ways existing work currently uses machine learning and stochastic methods for the challenging problem of visual tracking. It is not intended to study the whole tracking literature of the last decades as this seems impossible by the incredible vast number of published papers. This first draft version of the article focuses very targeted on recent literature that suggests Siamese networks for the learning of tracking. This approach promise a step forward in terms of robustness, accuracy and computational efficiency. For example, the representative tracker SINT performs currently best on the popular OTB-2013 benchmark with AuC/IoU/prec. 65.5/62.5/84.8 % for the one-pass experiment (OPE). The CVPR'17 work CVNet by the Oxford group shows the approach's large potential of HW/SW co-design with network memory needs around 600 kB and frame-rates of 75 fps and beyond. Before a detailed description of this approach is given, the article recaps the definition of tracking, the current state-of-the-art view on designing algorithms and the state-of-the-art of trackers by summarising insights from existing literature. In future, the article will be extended by the review of two alternative approaches, the one using very general recurrent networks such as the Long Shortterm Memory (LSTM) networks and the other most obvious approach of applying sole convolutional networks (CNN), the earliest approach since the idea of deep learning tracking appeared at NIPS'13.

Comments:	I would be very glad to everyone who supports me by comments and suggestions to substantially improve this working paper
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1707.00569 [cs.CV]
	(or arXiv:1707.00569v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1707.00569

Submission history

From: Roman Pflugfelder [view email]
[v1] Mon, 3 Jul 2017 14:27:10 UTC (607 KB)
[v2] Thu, 2 Aug 2018 07:15:14 UTC (758 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Siamese Learning Visual Tracking: A Survey

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Siamese Learning Visual Tracking: A Survey

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators