Self-supervised Multi-view Person Association and Its Applications

Vo, Minh; Yumer, Ersin; Sunkavalli, Kalyan; Hadap, Sunil; Sheikh, Yaser; Narasimhan, Srinivasa

doi:10.1109/TPAMI.2020.2974726

Computer Science > Computer Vision and Pattern Recognition

arXiv:1805.08717 (cs)

[Submitted on 22 May 2018 (v1), last revised 18 Apr 2020 (this version, v3)]

Title:Self-supervised Multi-view Person Association and Its Applications

Authors:Minh Vo, Ersin Yumer, Kalyan Sunkavalli, Sunil Hadap, Yaser Sheikh, Srinivasa Narasimhan

View PDF

Abstract:Reliable markerless motion tracking of people participating in a complex group activity from multiple moving cameras is challenging due to frequent occlusions, strong viewpoint and appearance variations, and asynchronous video streams. To solve this problem, reliable association of the same person across distant viewpoints and temporal instances is essential. We present a self-supervised framework to adapt a generic person appearance descriptor to the unlabeled videos by exploiting motion tracking, mutual exclusion constraints, and multi-view geometry. The adapted discriminative descriptor is used in a tracking-by-clustering formulation. We validate the effectiveness of our descriptor learning on WILDTRACK [14] and three new complex social scenes captured by multiple cameras with up to 60 people "in the wild". We report significant improvement in association accuracy (up to 18%) and stable and coherent 3D human skeleton tracking (5 to 10 times) over the baseline. Using the reconstructed 3D skeletons, we cut the input videos into a multi-angle video where the image of a specified person is shown from the best visible front-facing camera. Our algorithm detects inter-human occlusion to determine the camera switching moment while still maintaining the flow of the action well.

Comments:	Accepted to IEEE TPAMI
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1805.08717 [cs.CV]
	(or arXiv:1805.08717v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1805.08717
Related DOI:	https://doi.org/10.1109/TPAMI.2020.2974726

Submission history

From: Minh Vo [view email]
[v1] Tue, 22 May 2018 16:25:26 UTC (8,761 KB)
[v2] Thu, 15 Nov 2018 21:39:20 UTC (1 KB) (withdrawn)
[v3] Sat, 18 Apr 2020 06:16:40 UTC (8,896 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Multi-view Person Association and Its Applications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Multi-view Person Association and Its Applications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators