HadamRNN: Binary and Sparse Ternary Orthogonal RNNs

Foucault, Armand; Mamalet, Franck; Malgouyres, François

Computer Science > Machine Learning

arXiv:2502.00047 (cs)

[Submitted on 28 Jan 2025 (v1), last revised 5 Feb 2025 (this version, v2)]

Title:HadamRNN: Binary and Sparse Ternary Orthogonal RNNs

Authors:Armand Foucault (IMT, ANITI), Franck Mamalet (ANITI), François Malgouyres (IMT)

View PDF

Abstract:Binary and sparse ternary weights in neural networks enable faster computations and lighter representations, facilitating their use on edge devices with limited computational power. Meanwhile, vanilla RNNs are highly sensitive to changes in their recurrent weights, making the binarization and ternarization of these weights inherently challenging. To date, no method has successfully achieved binarization or ternarization of vanilla RNN weights. We present a new approach leveraging the properties of Hadamard matrices to parameterize a subset of binary and sparse ternary orthogonal matrices. This method enables the training of orthogonal RNNs (ORNNs) with binary and sparse ternary recurrent weights, effectively creating a specific class of binary and sparse ternary vanilla RNNs. The resulting ORNNs, called HadamRNN and lock-HadamRNN, are evaluated on benchmarks such as the copy task, permuted and sequential MNIST tasks, and IMDB dataset. Despite binarization or sparse ternarization, these RNNs maintain performance levels comparable to state-of-the-art full-precision models, highlighting the effectiveness of our approach. Notably, our approach is the first solution with binary recurrent weights capable of tackling the copy task over 1000 timesteps.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.00047 [cs.LG]
	(or arXiv:2502.00047v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.00047
Journal reference:	International Conference on Learning Representations (ICLR), Apr 2025, Singapour, Singapore

Submission history

From: Franck MAMALET [view email] [via CCSD proxy]
[v1] Tue, 28 Jan 2025 09:16:28 UTC (292 KB)
[v2] Wed, 5 Feb 2025 08:22:28 UTC (292 KB)

Computer Science > Machine Learning

Title:HadamRNN: Binary and Sparse Ternary Orthogonal RNNs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:HadamRNN: Binary and Sparse Ternary Orthogonal RNNs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators