Why do deep convolutional networks generalize so poorly to small image transformations?

Azulay, Aharon; Weiss, Yair

Computer Science > Computer Vision and Pattern Recognition

arXiv:1805.12177 (cs)

[Submitted on 30 May 2018 (v1), last revised 31 Dec 2019 (this version, v4)]

Title:Why do deep convolutional networks generalize so poorly to small image transformations?

Authors:Aharon Azulay, Yair Weiss

View PDF

Abstract:Convolutional Neural Networks (CNNs) are commonly assumed to be invariant to small image transformations: either because of the convolutional architecture or because they were trained using data augmentation. Recently, several authors have shown that this is not the case: small translations or rescalings of the input image can drastically change the network's prediction. In this paper, we quantify this phenomena and ask why neither the convolutional architecture nor data augmentation are sufficient to achieve the desired invariance. Specifically, we show that the convolutional architecture does not give invariance since architectures ignore the classical sampling theorem, and data augmentation does not give invariance because the CNNs learn to be invariant to transformations only for images that are very similar to typical images from the training set. We discuss two possible solutions to this problem: (1) antialiasing the intermediate representations and (2) increasing data augmentation and show that they provide only a partial solution at best. Taken together, our results indicate that the problem of insuring invariance to small image transformations in neural networks while preserving high accuracy remains unsolved.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1805.12177 [cs.CV]
	(or arXiv:1805.12177v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1805.12177
Journal reference:	JMLR 20(184) 1-25 2019

Submission history

From: Aharon Azulay [view email]
[v1] Wed, 30 May 2018 18:56:33 UTC (7,524 KB)
[v2] Mon, 18 Feb 2019 11:33:27 UTC (7,939 KB)
[v3] Tue, 6 Aug 2019 07:52:53 UTC (6,634 KB)
[v4] Tue, 31 Dec 2019 13:40:12 UTC (4,461 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Why do deep convolutional networks generalize so poorly to small image transformations?

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Why do deep convolutional networks generalize so poorly to small image transformations?

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators