Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Liu, Fusheng; Li, Qianxiao

Computer Science > Machine Learning

arXiv:2411.19455 (cs)

[Submitted on 29 Nov 2024]

Title:Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Authors:Fusheng Liu, Qianxiao Li

View PDF HTML (experimental)

Abstract:Current methods for initializing state space model (SSM) parameters primarily rely on the HiPPO framework \citep{gu2023how}, which is based on online function approximation with the SSM kernel basis. However, the HiPPO framework does not explicitly account for the effects of the temporal structures of input sequences on the optimization of SSMs. In this paper, we take a further step to investigate the roles of SSM initialization schemes by considering the autocorrelation of input sequences. Specifically, we: (1) rigorously characterize the dependency of the SSM timescale on sequence length based on sequence autocorrelation; (2) find that with a proper timescale, allowing a zero real part for the eigenvalues of the SSM state matrix mitigates the curse of memory while still maintaining stability at initialization; (3) show that the imaginary part of the eigenvalues of the SSM state matrix determines the conditioning of SSM optimization problems, and uncover an approximation-estimation tradeoff when training SSMs with a specific class of target functions.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2411.19455 [cs.LG]
	(or arXiv:2411.19455v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.19455

Submission history

From: Fusheng Liu [view email]
[v1] Fri, 29 Nov 2024 03:55:19 UTC (8,206 KB)

Computer Science > Machine Learning

Title:Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators