Surrogate Gradients Design

Herranz-Celotti, Luca; Rouat, Jean

Computer Science > Neural and Evolutionary Computing

arXiv:2202.00282v1 (cs)

[Submitted on 1 Feb 2022 (this version), latest version 5 Jan 2024 (v4)]

Title:Surrogate Gradients Design

Authors:Luca Herranz-Celotti, Jean Rouat

View PDF

Abstract:Surrogate gradient (SG) training provides the possibility to quickly transfer all the gains made in deep learning to neuromorphic computing and neuromorphic processors, with the consequent reduction in energy consumption. Evidence supports that training can be robust to the choice of SG shape, after an extensive search of hyper-parameters. However, random or grid search of hyper-parameters becomes exponentially unfeasible as we consider more hyper-parameters. Moreover, every point in the search can itself be highly time and energy consuming for large networks and large datasets. In this article we show how complex tasks and networks are more sensitive to SG choice. Secondly, we show how low dampening, high sharpness and low tail fatness are preferred. Thirdly, we observe that Glorot Uniform initialization is generally preferred by most SG choices, with variability in the results. We finally provide a theoretical solution to reduce the need of extensive gridsearch, to find SG shape and initializations that result in improved accuracy.

Subjects:	Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2202.00282 [cs.NE]
	(or arXiv:2202.00282v1 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.2202.00282

Submission history

From: Luca Celotti [view email]
[v1] Tue, 1 Feb 2022 09:10:57 UTC (434 KB)
[v2] Wed, 2 Feb 2022 12:10:47 UTC (442 KB)
[v3] Fri, 3 Nov 2023 12:04:38 UTC (672 KB)
[v4] Fri, 5 Jan 2024 00:28:16 UTC (677 KB)

Computer Science > Neural and Evolutionary Computing

Title:Surrogate Gradients Design

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Surrogate Gradients Design

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators