The Trifecta: Three simple techniques for training deeper Forward-Forward networks

Dooms, Thomas; Tsang, Ing Jyh; Oramas, Jose

Computer Science > Machine Learning

arXiv:2311.18130 (cs)

[Submitted on 29 Nov 2023 (v1), last revised 12 Dec 2023 (this version, v2)]

Title:The Trifecta: Three simple techniques for training deeper Forward-Forward networks

Authors:Thomas Dooms, Ing Jyh Tsang, Jose Oramas

View PDF HTML (experimental)

Abstract:Modern machine learning models are able to outperform humans on a variety of non-trivial tasks. However, as the complexity of the models increases, they consume significant amounts of power and still struggle to generalize effectively to unseen data. Local learning, which focuses on updating subsets of a model's parameters at a time, has emerged as a promising technique to address these issues. Recently, a novel local learning algorithm, called Forward-Forward, has received widespread attention due to its innovative approach to learning. Unfortunately, its application has been limited to smaller datasets due to scalability issues. To this end, we propose The Trifecta, a collection of three simple techniques that synergize exceptionally well and drastically improve the Forward-Forward algorithm on deeper networks. Our experiments demonstrate that our models are on par with similarly structured, backpropagation-based models in both training speed and test accuracy on simple datasets. This is achieved by the ability to learn representations that are informative locally, on a layer-by-layer basis, and retain their informativeness when propagated to deeper layers in the architecture. This leads to around 84% accuracy on CIFAR-10, a notable improvement (25%) over the original FF algorithm. These results highlight the potential of Forward-Forward as a genuine competitor to backpropagation and as a promising research avenue.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
MSC classes:	68T07
Cite as:	arXiv:2311.18130 [cs.LG]
	(or arXiv:2311.18130v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.18130

Submission history

From: Thomas Dooms [view email]
[v1] Wed, 29 Nov 2023 22:44:32 UTC (8,949 KB)
[v2] Tue, 12 Dec 2023 13:09:46 UTC (8,949 KB)

Computer Science > Machine Learning

Title:The Trifecta: Three simple techniques for training deeper Forward-Forward networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Trifecta: Three simple techniques for training deeper Forward-Forward networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators