Feature Lenses: Plug-and-play Neural Modules for Transformation-Invariant Visual Representations

Li, Shaohua; Sui, Xiuchao; Fu, Jie; Liu, Yong; Goh, Rick Siow Mong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2004.05554 (cs)

[Submitted on 12 Apr 2020]

Title:Feature Lenses: Plug-and-play Neural Modules for Transformation-Invariant Visual Representations

Authors:Shaohua Li, Xiuchao Sui, Jie Fu, Yong Liu, Rick Siow Mong Goh

View PDF

Abstract:Convolutional Neural Networks (CNNs) are known to be brittle under various image transformations, including rotations, scalings, and changes of lighting conditions. We observe that the features of a transformed image are drastically different from the ones of the original image. To make CNNs more invariant to transformations, we propose "Feature Lenses", a set of ad-hoc modules that can be easily plugged into a trained model (referred to as the "host model"). Each individual lens reconstructs the original features given the features of a transformed image under a particular transformation. These lenses jointly counteract feature distortions caused by various transformations, thus making the host model more robust without retraining. By only updating lenses, the host model is freed from iterative updating when facing new transformations absent in the training data; as feature semantics are preserved, downstream applications, such as classifiers and detectors, automatically gain robustness without retraining. Lenses are trained in a self-supervised fashion with no annotations, by minimizing a novel "Top-K Activation Contrast Loss" between lens-transformed features and original features. Evaluated on ImageNet, MNIST-rot, and CIFAR-10, Feature Lenses show clear advantages over baseline methods.

Comments:	20 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2004.05554 [cs.CV]
	(or arXiv:2004.05554v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2004.05554

Submission history

From: Shaohua Li [view email]
[v1] Sun, 12 Apr 2020 06:36:15 UTC (725 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Lenses: Plug-and-play Neural Modules for Transformation-Invariant Visual Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Lenses: Plug-and-play Neural Modules for Transformation-Invariant Visual Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators