Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking

Wang, Xucheng; Yang, Xiangyang; Ye, Hengzhou; Li, Shuiwang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.10262 (cs)

[Submitted on 20 Aug 2023]

Title:Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking

Authors:Xucheng Wang, Xiangyang Yang, Hengzhou Ye, Shuiwang Li

View PDF

Abstract:Efficiency has been a critical problem in UAV tracking due to limitations in computation resources, battery capacity, and unmanned aerial vehicle maximum load. Although discriminative correlation filters (DCF)-based trackers prevail in this field for their favorable efficiency, some recently proposed lightweight deep learning (DL)-based trackers using model compression demonstrated quite remarkable CPU efficiency as well as precision. Unfortunately, the model compression methods utilized by these works, though simple, are still unable to achieve satisfying tracking precision with higher compression rates. This paper aims to exploit disentangled representation learning with mutual information maximization (DR-MIM) to further improve DL-based trackers' precision and efficiency for UAV tracking. The proposed disentangled representation separates the feature into an identity-related and an identity-unrelated features. Only the latter is used, which enhances the effectiveness of the feature representation for subsequent classification and regression tasks. Extensive experiments on four UAV benchmarks, including UAV123@10fps, DTB70, UAVDT and VisDrone2018, show that our DR-MIM tracker significantly outperforms state-of-the-art UAV tracking methods.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.10262 [cs.CV]
	(or arXiv:2308.10262v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.10262

Submission history

From: Xucheng Wang [view email]
[v1] Sun, 20 Aug 2023 13:16:15 UTC (1,391 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators