Neural Architecture Search without Training

Mellor, Joseph; Turner, Jack; Storkey, Amos; Crowley, Elliot J.

Computer Science > Machine Learning

arXiv:2006.04647v2 (cs)

[Submitted on 8 Jun 2020 (v1), revised 26 Feb 2021 (this version, v2), latest version 11 Jun 2021 (v3)]

Title:Neural Architecture Search without Training

Authors:Joseph Mellor, Jack Turner, Amos Storkey, Elliot J. Crowley

View PDF

Abstract:The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network's trained accuracy from its initial state. In this work, we examine the overlap of activations between datapoints in untrained networks and motivate how this can give a measure which is usefully indicative of a network's trained performance. We incorporate this measure into a simple algorithm that allows us to search for powerful networks without any training in a matter of seconds on a single GPU, and verify its effectiveness on NAS-Bench-101, NAS-Bench-201, and Network Design Spaces. Finally, our approach can be readily combined with more expensive search methods; we examine a simple adaptation of regularised evolutionary search that outperforms its predecessor. Code for reproducing our experiments is available at this https URL.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2006.04647 [cs.LG]
	(or arXiv:2006.04647v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.04647

Submission history

From: Elliot J. Crowley [view email]
[v1] Mon, 8 Jun 2020 14:53:56 UTC (419 KB)
[v2] Fri, 26 Feb 2021 11:36:56 UTC (6,234 KB)
[v3] Fri, 11 Jun 2021 14:31:02 UTC (1,457 KB)

Computer Science > Machine Learning

Title:Neural Architecture Search without Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Neural Architecture Search without Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators