Optimisation & Generalisation in Networks of Neurons

Bernstein, Jeremy

Computer Science > Neural and Evolutionary Computing

arXiv:2210.10101 (cs)

[Submitted on 18 Oct 2022]

Title:Optimisation & Generalisation in Networks of Neurons

Authors:Jeremy Bernstein

View PDF

Abstract:The goal of this thesis is to develop the optimisation and generalisation theoretic foundations of learning in artificial neural networks. On optimisation, a new theoretical framework is proposed for deriving architecture-dependent first-order optimisation algorithms. The approach works by combining a "functional majorisation" of the loss function with "architectural perturbation bounds" that encode an explicit dependence on neural architecture. The framework yields optimisation methods that transfer hyperparameters across learning problems. On generalisation, a new correspondence is proposed between ensembles of networks and individual networks. It is argued that, as network width and normalised margin are taken large, the space of networks that interpolate a particular training set concentrates on an aggregated Bayesian method known as a "Bayes point machine". This correspondence provides a route for transferring PAC-Bayesian generalisation theorems over to individual networks. More broadly, the correspondence presents a fresh perspective on the role of regularisation in networks with vastly more parameters than data.

Comments:	PhD thesis
Subjects:	Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG); Numerical Analysis (math.NA)
Cite as:	arXiv:2210.10101 [cs.NE]
	(or arXiv:2210.10101v1 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.2210.10101

Submission history

From: Jeremy Bernstein [view email]
[v1] Tue, 18 Oct 2022 18:58:40 UTC (3,852 KB)

Computer Science > Neural and Evolutionary Computing

Title:Optimisation & Generalisation in Networks of Neurons

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Optimisation & Generalisation in Networks of Neurons

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators