Object-based (yet Class-agnostic) Video Domain Adaptation

Niu, Dantong; Bar, Amir; Herzig, Roei; Darrell, Trevor; Rohrbach, Anna

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.17942 (cs)

[Submitted on 29 Nov 2023]

Title:Object-based (yet Class-agnostic) Video Domain Adaptation

Authors:Dantong Niu, Amir Bar, Roei Herzig, Trevor Darrell, Anna Rohrbach

View PDF

Abstract:Existing video-based action recognition systems typically require dense annotation and struggle in environments when there is significant distribution shift relative to the training data. Current methods for video domain adaptation typically fine-tune the model using fully annotated data on a subset of target domain data or align the representation of the two domains using bootstrapping or adversarial learning. Inspired by the pivotal role of objects in recent supervised object-centric action recognition models, we present Object-based (yet Class-agnostic) Video Domain Adaptation (ODAPT), a simple yet effective framework for adapting the existing action recognition systems to new domains by utilizing a sparse set of frames with class-agnostic object annotations in a target domain. Our model achieves a +6.5 increase when adapting across kitchens in Epic-Kitchens and a +3.1 increase adapting between Epic-Kitchens and the EGTEA dataset. ODAPT is a general framework that can also be combined with previous unsupervised methods, offering a +5.0 boost when combined with the self-supervised multi-modal method MMSADA and a +1.7 boost when added to the adversarial-based method TA$^3$N on Epic-Kitchens.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.17942 [cs.CV]
	(or arXiv:2311.17942v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.17942

Submission history

From: Dantong Niu [view email]
[v1] Wed, 29 Nov 2023 01:17:38 UTC (14,744 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Object-based (yet Class-agnostic) Video Domain Adaptation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Object-based (yet Class-agnostic) Video Domain Adaptation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators