Domain Generalization via Balancing Training Difficulty and Model Capability

Jiang, Xueying; Huang, Jiaxing; Jin, Sheng; Lu, Shijian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.00844 (cs)

[Submitted on 2 Sep 2023]

Title:Domain Generalization via Balancing Training Difficulty and Model Capability

Authors:Xueying Jiang, Jiaxing Huang, Sheng Jin, Shijian Lu

View PDF

Abstract:Domain generalization (DG) aims to learn domain-generalizable models from one or multiple source domains that can perform well in unseen target domains. Despite its recent progress, most existing work suffers from the misalignment between the difficulty level of training samples and the capability of contemporarily trained models, leading to over-fitting or under-fitting in the trained generalization model. We design MoDify, a Momentum Difficulty framework that tackles the misalignment by balancing the seesaw between the model's capability and the samples' difficulties along the training process. MoDify consists of two novel designs that collaborate to fight against the misalignment while learning domain-generalizable models. The first is MoDify-based Data Augmentation which exploits an RGB Shuffle technique to generate difficulty-aware training samples on the fly. The second is MoDify-based Network Optimization which dynamically schedules the training samples for balanced and smooth learning with appropriate difficulty. Without bells and whistles, a simple implementation of MoDify achieves superior performance across multiple benchmarks. In addition, MoDify can complement existing methods as a plug-in, and it is generic and can work for different visual recognition tasks.

Comments:	11 pages, 6 figures, Accepted by ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.00844 [cs.CV]
	(or arXiv:2309.00844v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.00844

Submission history

From: Xueying Jiang [view email]
[v1] Sat, 2 Sep 2023 07:09:23 UTC (926 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Domain Generalization via Balancing Training Difficulty and Model Capability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Domain Generalization via Balancing Training Difficulty and Model Capability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators