Slot State Space Models

Jiang, Jindong; Deng, Fei; Singh, Gautam; Lee, Minseung; Ahn, Sungjin

Computer Science > Artificial Intelligence

arXiv:2406.12272 (cs)

[Submitted on 18 Jun 2024 (v1), last revised 29 Nov 2024 (this version, v6)]

Title:Slot State Space Models

Authors:Jindong Jiang, Fei Deng, Gautam Singh, Minseung Lee, Sungjin Ahn

View PDF HTML (experimental)

Abstract:Recent State Space Models (SSMs) such as S4, S5, and Mamba have shown remarkable computational benefits in long-range temporal dependency modeling. However, in many sequence modeling problems, the underlying process is inherently modular and it is of interest to have inductive biases that mimic this modular structure. In this paper, we introduce SlotSSMs, a novel framework for incorporating independent mechanisms into SSMs to preserve or encourage separation of information. Unlike conventional SSMs that maintain a monolithic state vector, SlotSSMs maintains the state as a collection of multiple vectors called slots. Crucially, the state transitions are performed independently per slot with sparse interactions across slots implemented via the bottleneck of self-attention. In experiments, we evaluate our model in object-centric learning, 3D visual reasoning, and long-context video understanding tasks, which involve modeling multiple objects and their long-range temporal dependencies. We find that our proposed design offers substantial performance gains over existing sequence modeling methods. Project page is available at this https URL

Comments:	Accepted to NeurIPS 2024; Project page is available at this https URL ; Code is available at this https URL
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.12272 [cs.AI]
	(or arXiv:2406.12272v6 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2406.12272

Submission history

From: Jindong Jiang [view email]
[v1] Tue, 18 Jun 2024 04:59:14 UTC (4,714 KB)
[v2] Wed, 19 Jun 2024 22:53:36 UTC (4,716 KB)
[v3] Wed, 26 Jun 2024 03:04:04 UTC (4,823 KB)
[v4] Sun, 30 Jun 2024 22:25:01 UTC (4,823 KB)
[v5] Wed, 21 Aug 2024 20:54:33 UTC (4,823 KB)
[v6] Fri, 29 Nov 2024 21:23:51 UTC (11,672 KB)

Computer Science > Artificial Intelligence

Title:Slot State Space Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Slot State Space Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators