Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition

Zou, Yixiong; Zhang, Shanghang; Yu, JianPeng; Tian, Yonghong; Moura, José M. F.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2008.03128 (cs)

[Submitted on 7 Aug 2020 (v1), last revised 1 Nov 2021 (this version, v4)]

Title:Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition

Authors:Yixiong Zou, Shanghang Zhang, JianPeng Yu, Yonghong Tian, José M. F. Moura

View PDF

Abstract:Existing few-shot learning (FSL) methods usually assume base classes and novel classes are from the same domain (in-domain setting). However, in practice, it may be infeasible to collect sufficient training samples for some special domains to construct base classes. To solve this problem, cross-domain FSL (CDFSL) is proposed very recently to transfer knowledge from general-domain base classes to special-domain novel classes. Existing CDFSL works mostly focus on transferring between near domains, while rarely consider transferring between distant domains, which is in practical need as any novel classes could appear in real-world applications, and is even more challenging. In this paper, we study a challenging subset of CDFSL where the novel classes are in distant domains from base classes, by revisiting the mid-level features, which are more transferable yet under-explored in main stream FSL work. To boost the discriminability of mid-level features, we propose a residual-prediction task to encourage mid-level features to learn discriminative information of each sample. Notably, such mechanism also benefits the in-domain FSL and CDFSL in near domains. Therefore, we provide two types of features for both cross- and in-domain FSL respectively, under the same training framework. Experiments under both settings on six public datasets, including two challenging medical datasets, validate the our rationale and demonstrate state-of-the-art performance. Code will be released.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2008.03128 [cs.CV]
	(or arXiv:2008.03128v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2008.03128

Submission history

From: Yixiong Zou [view email]
[v1] Fri, 7 Aug 2020 12:45:39 UTC (7,760 KB)
[v2] Mon, 21 Sep 2020 23:54:01 UTC (2,059 KB)
[v3] Tue, 20 Apr 2021 09:03:52 UTC (10,454 KB)
[v4] Mon, 1 Nov 2021 03:18:25 UTC (10,455 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators