Decoupled Mixup for Generalized Visual Recognition

Liu, Haozhe; Zhang, Wentian; Xie, Jinheng; Wu, Haoqian; Li, Bing; Zhang, Ziqi; Li, Yuexiang; Huang, Yawen; Ghanem, Bernard; Zheng, Yefeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.14783 (cs)

[Submitted on 26 Oct 2022]

Title:Decoupled Mixup for Generalized Visual Recognition

Authors:Haozhe Liu, Wentian Zhang, Jinheng Xie, Haoqian Wu, Bing Li, Ziqi Zhang, Yuexiang Li, Yawen Huang, Bernard Ghanem, Yefeng Zheng

View PDF

Abstract:Convolutional neural networks (CNN) have demonstrated remarkable performance when the training and testing data are from the same distribution. However, such trained CNN models often largely degrade on testing data which is unseen and Out-Of-the-Distribution (OOD). To address this issue, we propose a novel "Decoupled-Mixup" method to train CNN models for OOD visual recognition. Different from previous work combining pairs of images homogeneously, our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combines these regions of image pairs to train CNN models. Since the observation is that noise-prone regions such as textural and clutter backgrounds are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair. To further improve the generalization ability of trained models, we propose to disentangle discriminative and noise-prone regions in frequency-based and context-based fashions. Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts, where our method achieves 85.76\% top-1 accuracy in Track-1 and 79.92\% in Track-2 in the NICO Challenge. The source code is available at this https URL.

Comments:	Accepted by ECCV'2022 Workshop: Causality in Vision
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2210.14783 [cs.CV]
	(or arXiv:2210.14783v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.14783

Submission history

From: Haozhe Liu [view email]
[v1] Wed, 26 Oct 2022 15:21:39 UTC (1,937 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Decoupled Mixup for Generalized Visual Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Decoupled Mixup for Generalized Visual Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators