CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Qiu, Tianheng; Law, Ka Lung; Pan, Guanghua; Wang, Jufei; Gao, Xin; Huang, Xuan; Wei, Hu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.11812 (cs)

[Submitted on 16 Dec 2024]

Title:CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Authors:Tianheng Qiu, Ka Lung Law, Guanghua Pan, Jufei Wang, Xin Gao, Xuan Huang, Hu Wei

View PDF HTML (experimental)

Abstract:Unsupervised domain adaptive (UDA) algorithms can markedly enhance the performance of object detectors under conditions of domain shifts, thereby reducing the necessity for extensive labeling and retraining. Current domain adaptive object detection algorithms primarily cater to two-stage detectors, which tend to offer minimal improvements when directly applied to single-stage detectors such as YOLO. Intending to benefit the YOLO detector from UDA, we build a comprehensive domain adaptive architecture using a teacher-student cooperative system for the YOLO detector. In this process, we propose uncertainty learning to cope with pseudo-labeling generated by the teacher model with extreme uncertainty and leverage dynamic data augmentation to asymptotically adapt the teacher-student system to the environment. To address the inability of single-stage object detectors to align at multiple stages, we utilize a unified visual contrastive learning paradigm that aligns instance at backbone and head respectively, which steadily improves the robustness of the detectors in cross-domain tasks. In summary, we present an unsupervised domain adaptive YOLO detector based on visual contrastive learning (CLDA-YOLO), which achieves highly competitive results across multiple domain adaptive datasets without any reduction in inference speed.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.11812 [cs.CV]
	(or arXiv:2412.11812v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.11812

Submission history

From: Tianheng Qiu [view email]
[v1] Mon, 16 Dec 2024 14:25:52 UTC (18,838 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators