The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

Chang, Dongliang; Ding, Yifeng; Xie, Jiyang; Bhunia, Ayan Kumar; Li, Xiaoxu; Ma, Zhanyu; Wu, Ming; Guo, Jun; Song, Yi-Zhe

doi:10.1109/TIP.2020.2973812

Computer Science > Computer Vision and Pattern Recognition

arXiv:2002.04264 (cs)

[Submitted on 11 Feb 2020 (v1), last revised 10 Aug 2021 (this version, v3)]

Title:The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

Authors:Dongliang Chang, Yifeng Ding, Jiyang Xie, Ayan Kumar Bhunia, Xiaoxu Li, Zhanyu Ma, Ming Wu, Jun Guo, Yi-Zhe Song

View PDF

Abstract:Key for solving fine-grained image categorization is finding discriminate and local regions that correspond to subtle visual traits. Great strides have been made, with complex networks designed specifically to learn part-level discriminate feature representations. In this paper, we show it is possible to cultivate subtle details without the need for overly complicated network designs or training mechanisms -- a single loss is all it takes. The main trick lies with how we delve into individual feature channels early on, as opposed to the convention of starting from a consolidated feature map. The proposed loss function, termed as mutual-channel loss (MC-Loss), consists of two channel-specific components: a discriminality component and a diversity component. The discriminality component forces all feature channels belonging to the same class to be discriminative, through a novel channel-wise attention mechanism. The diversity component additionally constraints channels so that they become mutually exclusive on spatial-wise. The end result is therefore a set of feature channels that each reflects different locally discriminative regions for a specific class. The MC-Loss can be trained end-to-end, without the need for any bounding-box/part annotations, and yields highly discriminative regions during inference. Experimental results show our MC-Loss when implemented on top of common base networks can achieve state-of-the-art performance on all four fine-grained categorization datasets (CUB-Birds, FGVC-Aircraft, Flowers-102, and Stanford-Cars). Ablative studies further demonstrate the superiority of MC-Loss when compared with other recently proposed general-purpose losses for visual classification, on two different base networks. Code available at this https URL

Comments:	TIP2020. Code available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2002.04264 [cs.CV]
	(or arXiv:2002.04264v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2002.04264
Related DOI:	https://doi.org/10.1109/TIP.2020.2973812

Submission history

From: Dongliang Chang [view email]
[v1] Tue, 11 Feb 2020 09:12:45 UTC (4,654 KB)
[v2] Thu, 23 Apr 2020 12:56:57 UTC (2,549 KB)
[v3] Tue, 10 Aug 2021 04:23:56 UTC (24,478 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators