Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition

Zhang, Yurong; Chen, Honghao; Zhang, Xinyu; Chu, Xiangxiang; Song, Li

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.14302 (cs)

[Submitted on 19 Jul 2024 (v1), last revised 23 Jul 2024 (this version, v2)]

Title:Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition

Authors:Yurong Zhang, Honghao Chen, Xinyu Zhang, Xiangxiang Chu, Li Song

View PDF HTML (experimental)

Abstract:Parameter-efficient transfer learning (PETL) is a promising task, aiming to adapt the large-scale pre-trained model to downstream tasks with a relatively modest cost. However, current PETL methods struggle in compressing computational complexity and bear a heavy inference burden due to the complete forward process. This paper presents an efficient visual recognition paradigm, called Dynamic Adapter (Dyn-Adapter), that boosts PETL efficiency by subtly disentangling features in multiple levels. Our approach is simple: first, we devise a dynamic architecture with balanced early heads for multi-level feature extraction, along with adaptive training strategy. Second, we introduce a bidirectional sparsity strategy driven by the pursuit of powerful generalization ability. These qualities enable us to fine-tune efficiently and effectively: we reduce FLOPs during inference by 50%, while maintaining or even yielding higher recognition accuracy. Extensive experiments on diverse datasets and pretrained backbones demonstrate the potential of Dyn-Adapter serving as a general efficiency booster for PETL in vision recognition tasks.

Comments:	ECCV 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.14302 [cs.CV]
	(or arXiv:2407.14302v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.14302

Submission history

From: Yurong Zhang [view email]
[v1] Fri, 19 Jul 2024 13:33:38 UTC (815 KB)
[v2] Tue, 23 Jul 2024 07:57:17 UTC (815 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators