The Tunnel Effect: Building Data Representations in Deep Neural Networks

Masarczyk, Wojciech; Ostaszewski, Mateusz; Imani, Ehsan; Pascanu, Razvan; Miłoś, Piotr; Trzciński, Tomasz

Computer Science > Machine Learning

arXiv:2305.19753 (cs)

[Submitted on 31 May 2023 (v1), last revised 30 Oct 2023 (this version, v2)]

Title:The Tunnel Effect: Building Data Representations in Deep Neural Networks

Authors:Wojciech Masarczyk, Mateusz Ostaszewski, Ehsan Imani, Razvan Pascanu, Piotr Miłoś, Tomasz Trzciński

View PDF

Abstract:Deep neural networks are widely known for their remarkable effectiveness across various tasks, with the consensus that deeper networks implicitly learn more complex data representations. This paper shows that sufficiently deep networks trained for supervised image classification split into two distinct parts that contribute to the resulting data representations differently. The initial layers create linearly-separable representations, while the subsequent layers, which we refer to as \textit{the tunnel}, compress these representations and have a minimal impact on the overall performance. We explore the tunnel's behavior through comprehensive empirical studies, highlighting that it emerges early in the training process. Its depth depends on the relation between the network's capacity and task complexity. Furthermore, we show that the tunnel degrades out-of-distribution generalization and discuss its implications for continual learning.

Comments:	NeurIPS 2023
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.19753 [cs.LG]
	(or arXiv:2305.19753v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.19753

Submission history

From: Wojciech Masarczyk [view email]
[v1] Wed, 31 May 2023 11:38:24 UTC (891 KB)
[v2] Mon, 30 Oct 2023 12:41:27 UTC (1,208 KB)

Computer Science > Machine Learning

Title:The Tunnel Effect: Building Data Representations in Deep Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Tunnel Effect: Building Data Representations in Deep Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators