Feature Fusion from Head to Tail for Long-Tailed Visual Recognition

Li, Mengke; Hu, Zhikai; Lu, Yang; Lan, Weichao; Cheung, Yiu-ming; Huang, Hui

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.06963 (cs)

[Submitted on 12 Jun 2023 (v1), last revised 18 Dec 2023 (this version, v3)]

Title:Feature Fusion from Head to Tail for Long-Tailed Visual Recognition

Authors:Mengke Li, Zhikai Hu, Yang Lu, Weichao Lan, Yiu-ming Cheung, Hui Huang

View PDF HTML (experimental)

Abstract:The imbalanced distribution of long-tailed data presents a considerable challenge for deep learning models, as it causes them to prioritize the accurate classification of head classes but largely disregard tail classes. The biased decision boundary caused by inadequate semantic information in tail classes is one of the key factors contributing to their low recognition accuracy. To rectify this issue, we propose to augment tail classes by grafting the diverse semantic information from head classes, referred to as head-to-tail fusion (H2T). We replace a portion of feature maps from tail classes with those belonging to head classes. These fused features substantially enhance the diversity of tail classes. Both theoretical analysis and practical experimentation demonstrate that H2T can contribute to a more optimized solution for the decision boundary. We seamlessly integrate H2T in the classifier adjustment stage, making it a plug-and-play module. Its simplicity and ease of implementation allow for smooth integration with existing long-tailed recognition methods, facilitating a further performance boost. Extensive experiments on various long-tailed benchmarks demonstrate the effectiveness of the proposed H2T. The source code is available at this https URL.

Comments:	Accepted to AAAI24, similar to the conference version. Add the supplementry
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.06963 [cs.CV]
	(or arXiv:2306.06963v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.06963

Submission history

From: Mengke Li [view email]
[v1] Mon, 12 Jun 2023 08:50:46 UTC (5,631 KB)
[v2] Thu, 14 Dec 2023 06:00:11 UTC (6,406 KB)
[v3] Mon, 18 Dec 2023 14:39:46 UTC (6,406 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Fusion from Head to Tail for Long-Tailed Visual Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Fusion from Head to Tail for Long-Tailed Visual Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators