Inductive Bias of Deep Convolutional Networks through Pooling Geometry

Cohen, Nadav; Shashua, Amnon

Computer Science > Neural and Evolutionary Computing

arXiv:1605.06743v4 (cs)

[Submitted on 22 May 2016 (v1), last revised 17 Apr 2017 (this version, v4)]

Title:Inductive Bias of Deep Convolutional Networks through Pooling Geometry

Authors:Nadav Cohen, Amnon Shashua

View PDF

Abstract:Our formal understanding of the inductive bias that drives the success of convolutional networks on computer vision tasks is limited. In particular, it is unclear what makes hypotheses spaces born from convolution and pooling operations so suitable for natural images. In this paper we study the ability of convolutional networks to model correlations among regions of their input. We theoretically analyze convolutional arithmetic circuits, and empirically validate our findings on other types of convolutional networks as well. Correlations are formalized through the notion of separation rank, which for a given partition of the input, measures how far a function is from being separable. We show that a polynomially sized deep network supports exponentially high separation ranks for certain input partitions, while being limited to polynomial separation ranks for others. The network's pooling geometry effectively determines which input partitions are favored, thus serves as a means for controlling the inductive bias. Contiguous pooling windows as commonly employed in practice favor interleaved partitions over coarse ones, orienting the inductive bias towards the statistics of natural images. Other pooling schemes lead to different preferences, and this allows tailoring the network to data that departs from the usual domain of natural imagery. In addition to analyzing deep networks, we show that shallow ones support only linear separation ranks, and by this gain insight into the benefit of functions brought forth by depth - they are able to efficiently model strong correlation under favored partitions of the input.

Comments:	Published as a conference paper at ICLR 2017
Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
Cite as:	arXiv:1605.06743 [cs.NE]
	(or arXiv:1605.06743v4 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1605.06743

Submission history

From: Nadav Cohen [view email]
[v1] Sun, 22 May 2016 06:15:31 UTC (295 KB)
[v2] Fri, 4 Nov 2016 16:06:20 UTC (606 KB)
[v3] Wed, 14 Dec 2016 10:29:18 UTC (606 KB)
[v4] Mon, 17 Apr 2017 18:36:08 UTC (606 KB)

Computer Science > Neural and Evolutionary Computing

Title:Inductive Bias of Deep Convolutional Networks through Pooling Geometry

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Inductive Bias of Deep Convolutional Networks through Pooling Geometry

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators