Interplay between depth and width for interpolation in neural ODEs

Álvarez-López, Antonio; Slimane, Arselane Hadj; Zuazua, Enrique

Mathematics > Optimization and Control

arXiv:2401.09902 (math)

[Submitted on 18 Jan 2024 (v1), last revised 6 Feb 2024 (this version, v3)]

Title:Interplay between depth and width for interpolation in neural ODEs

Authors:Antonio Álvarez-López, Arselane Hadj Slimane, Enrique Zuazua

View PDF

Abstract:Neural ordinary differential equations (neural ODEs) have emerged as a natural tool for supervised learning from a control perspective, yet a complete understanding of their optimal architecture remains elusive. In this work, we examine the interplay between their width $p$ and number of layer transitions $L$ (effectively the depth $L+1$). Specifically, we assess the model expressivity in terms of its capacity to interpolate either a finite dataset $D$ comprising $N$ pairs of points or two probability measures in $\mathbb{R}^d$ within a Wasserstein error margin $\varepsilon>0$. Our findings reveal a balancing trade-off between $p$ and $L$, with $L$ scaling as $O(1+N/p)$ for dataset interpolation, and $L=O\left(1+(p\varepsilon^d)^{-1}\right)$ for measure interpolation.
In the autonomous case, where $L=0$, a separate study is required, which we undertake focusing on dataset interpolation. We address the relaxed problem of $\varepsilon$-approximate controllability and establish an error decay of $\varepsilon\sim O(\log(p)p^{-1/d})$. This decay rate is a consequence of applying a universal approximation theorem to a custom-built Lipschitz vector field that interpolates $D$. In the high-dimensional setting, we further demonstrate that $p=O(N)$ neurons are likely sufficient to achieve exact control.

Comments:	16 pages, 10 figures, double column
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
MSC classes:	34H05, 68T07, 93B05 (Primary) 35Q49 (Secondary)
Cite as:	arXiv:2401.09902 [math.OC]
	(or arXiv:2401.09902v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2401.09902

Submission history

From: Antonio Álvarez-López [view email]
[v1] Thu, 18 Jan 2024 11:32:50 UTC (4,256 KB)
[v2] Fri, 19 Jan 2024 14:04:22 UTC (4,257 KB)
[v3] Tue, 6 Feb 2024 17:05:48 UTC (4,242 KB)

Mathematics > Optimization and Control

Title:Interplay between depth and width for interpolation in neural ODEs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Interplay between depth and width for interpolation in neural ODEs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators