Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures

Moreno-Pino, Fernando; Arroyo, Álvaro; Waldon, Harrison; Dong, Xiaowen; Cartea, Álvaro

Statistics > Machine Learning

arXiv:2405.20799 (stat)

[Submitted on 31 May 2024 (v1), last revised 28 Oct 2024 (this version, v2)]

Title:Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures

Authors:Fernando Moreno-Pino, Álvaro Arroyo, Harrison Waldon, Xiaowen Dong, Álvaro Cartea

View PDF

Abstract:Time-series data in real-world settings typically exhibit long-range dependencies and are observed at non-uniform intervals. In these settings, traditional sequence-based recurrent models struggle. To overcome this, researchers often replace recurrent architectures with Neural ODE-based models to account for irregularly sampled data and use Transformer-based architectures to account for long-range dependencies. Despite the success of these two approaches, both incur very high computational costs for input sequences of even moderate length. To address this challenge, we introduce the Rough Transformer, a variation of the Transformer model that operates on continuous-time representations of input sequences and incurs significantly lower computational costs. In particular, we propose multi-view signature attention, which uses path signatures to augment vanilla attention and to capture both local and global (multi-scale) dependencies in the input data, while remaining robust to changes in the sequence length and sampling frequency and yielding improved spatial processing. We find that, on a variety of time-series-related tasks, Rough Transformers consistently outperform their vanilla attention counterparts while obtaining the representational benefits of Neural ODE-based models, all at a fraction of the computational time and memory resources.

Comments:	NeurIPS 2024 Conference (Camera Ready Version)
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2405.20799 [stat.ML]
	(or arXiv:2405.20799v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2405.20799

Submission history

From: Fernando Moreno-Pino [view email]
[v1] Fri, 31 May 2024 14:00:44 UTC (1,304 KB)
[v2] Mon, 28 Oct 2024 16:22:24 UTC (2,072 KB)

Statistics > Machine Learning

Title:Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators