Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment

Launay, Julien; Poli, Iacopo; Müller, Kilian; Pariente, Gustave; Carron, Igor; Daudet, Laurent; Krzakala, Florent; Gigan, Sylvain

Computer Science > Machine Learning

arXiv:2012.06373 (cs)

[Submitted on 11 Dec 2020]

Title:Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment

Authors:Julien Launay, Iacopo Poli, Kilian Müller, Gustave Pariente, Igor Carron, Laurent Daudet, Florent Krzakala, Sylvain Gigan

View PDF

Abstract:The scaling hypothesis motivates the expansion of models past trillions of parameters as a path towards better performance. Recent significant developments, such as GPT-3, have been driven by this conjecture. However, as models scale-up, training them efficiently with backpropagation becomes difficult. Because model, pipeline, and data parallelism distribute parameters and gradients over compute nodes, communication is challenging to orchestrate: this is a bottleneck to further scaling. In this work, we argue that alternative training methods can mitigate these issues, and can inform the design of extreme-scale training hardware. Indeed, using a synaptically asymmetric method with a parallelizable backward pass, such as Direct Feedback Alignement, communication needs are drastically reduced. We present a photonic accelerator for Direct Feedback Alignment, able to compute random projections with trillions of parameters. We demonstrate our system on benchmark tasks, using both fully-connected and graph convolutional networks. Our hardware is the first architecture-agnostic photonic co-processor for training neural networks. This is a significant step towards building scalable hardware, able to go beyond backpropagation, and opening new avenues for deep learning.

Comments:	6 pages, 2 figures, 1 table. Oral at the Beyond Backpropagation Workshop, NeurIPS 2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:2012.06373 [cs.LG]
	(or arXiv:2012.06373v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2012.06373

Submission history

From: Julien Launay [view email]
[v1] Fri, 11 Dec 2020 14:20:45 UTC (456 KB)

Computer Science > Machine Learning

Title:Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators