Learning Domain-Aware Detection Head with Prompt Tuning

Li, Haochen; Zhang, Rui; Yao, Hantao; Song, Xinkai; Hao, Yifan; Zhao, Yongwei; Li, Ling; Chen, Yunji

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.05718 (cs)

[Submitted on 9 Jun 2023 (v1), last revised 10 Oct 2023 (this version, v3)]

Title:Learning Domain-Aware Detection Head with Prompt Tuning

Authors:Haochen Li, Rui Zhang, Hantao Yao, Xinkai Song, Yifan Hao, Yongwei Zhao, Ling Li, Yunji Chen

View PDF

Abstract:Domain adaptive object detection (DAOD) aims to generalize detectors trained on an annotated source domain to an unlabelled target domain. However, existing methods focus on reducing the domain bias of the detection backbone by inferring a discriminative visual encoder, while ignoring the domain bias in the detection head. Inspired by the high generalization of vision-language models (VLMs), applying a VLM as the robust detection backbone following a domain-aware detection head is a reasonable way to learn the discriminative detector for each domain, rather than reducing the domain bias in traditional methods. To achieve the above issue, we thus propose a novel DAOD framework named Domain-Aware detection head with Prompt tuning (DA-Pro), which applies the learnable domain-adaptive prompt to generate the dynamic detection head for each domain. Formally, the domain-adaptive prompt consists of the domain-invariant tokens, domain-specific tokens, and the domain-related textual description along with the class label. Furthermore, two constraints between the source and target domains are applied to ensure that the domain-adaptive prompt can capture the domains-shared and domain-specific knowledge. A prompt ensemble strategy is also proposed to reduce the effect of prompt disturbance. Comprehensive experiments over multiple cross-domain adaptation tasks demonstrate that using the domain-adaptive prompt can produce an effectively domain-related detection head for boosting domain-adaptive object detection. Our code is available at this https URL.

Comments:	Accepted by NeurIPS 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.05718 [cs.CV]
	(or arXiv:2306.05718v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.05718

Submission history

From: Haochen Li [view email]
[v1] Fri, 9 Jun 2023 07:30:10 UTC (5,359 KB)
[v2] Sun, 8 Oct 2023 04:30:12 UTC (5,446 KB)
[v3] Tue, 10 Oct 2023 03:49:32 UTC (5,446 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Domain-Aware Detection Head with Prompt Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Domain-Aware Detection Head with Prompt Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators