Fast Training of Recurrent Neural Networks with Stationary State Feedbacks

Caillon, Paul; Fagnou, Erwan; Allauzen, Alexandre

Computer Science > Machine Learning

arXiv:2503.23104 (cs)

[Submitted on 29 Mar 2025]

Title:Fast Training of Recurrent Neural Networks with Stationary State Feedbacks

Authors:Paul Caillon (1), Erwan Fagnou (1), Alexandre Allauzen (1 and 2) ((1) Miles Team, LAMSADE, Université Paris Dauphine - PSL, Paris, France, (2) ESPCI PSL, Paris, France)

View PDF HTML (experimental)

Abstract:Recurrent neural networks (RNNs) have recently demonstrated strong performance and faster inference than Transformers at comparable parameter budgets. However, the recursive gradient computation with the backpropagation through time (or BPTT) algorithm remains the major computational bottleneck. In this work, we propose a novel method that replaces BPTT with a fixed gradient feedback mechanism, yielding an efficient approximation of the exact gradient propagation based on the assumption of time stationarity. Our approach leverages state-space model (SSM) principles to define a structured feedback matrix that directly propagates gradients from future time steps. This formulation bypasses the need for recursive gradient backpropagation, significantly reducing training overhead while preserving the network's ability to capture long-term dependencies. The experiments on language modeling benchmarks exhibit competitive perplexity scores, while significantly reducing the training costs. These promising results suggest that designing a feedback method like an SSM can fully exploit the efficiency advantages of RNNs for many practical applications.

Comments:	18 pages (including additional contents), 3 figures, 5 tables, code available at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2503.23104 [cs.LG]
	(or arXiv:2503.23104v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.23104

Submission history

From: Paul Caillon [view email]
[v1] Sat, 29 Mar 2025 14:45:52 UTC (668 KB)

Computer Science > Machine Learning

Title:Fast Training of Recurrent Neural Networks with Stationary State Feedbacks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Fast Training of Recurrent Neural Networks with Stationary State Feedbacks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators