MATEY: multiscale adaptive foundation models for spatiotemporal physical systems

Zhang, Pei; Laiu, M. Paul; Norman, Matthew; Stefanski, Doug; Gounley, John

Computer Science > Machine Learning

arXiv:2412.20601 (cs)

[Submitted on 29 Dec 2024]

Title:MATEY: multiscale adaptive foundation models for spatiotemporal physical systems

Authors:Pei Zhang, M. Paul Laiu, Matthew Norman, Doug Stefanski, John Gounley

View PDF HTML (experimental)

Abstract:Accurate representation of the multiscale features in spatiotemporal physical systems using vision transformer (ViT) architectures requires extremely long, computationally prohibitive token sequences. To address this issue, we propose two adaptive tokenization schemes that dynamically adjust patch sizes based on local features: one ensures convergent behavior to uniform patch refinement, while the other offers better computational efficiency. Moreover, we present a set of spatiotemporal attention schemes, where the temporal or axial spatial dimensions are decoupled, and evaluate their computational and data efficiencies. We assess the performance of the proposed multiscale adaptive model, MATEY, in a sequence of experiments. The results show that adaptive tokenization schemes achieve improved accuracy without significantly increasing the length of the token sequence. Compared to a full spatiotemporal attention scheme or a scheme that decouples only the temporal dimension, we find that fully decoupled axial attention is less efficient and expressive, requiring more training time and model weights to achieve the same accuracy. Finally, we demonstrate in two fine-tuning tasks featuring different physics that models pretrained on PDEBench data outperform the ones trained from scratch, especially in the low data regime with frozen attention.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE)
Cite as:	arXiv:2412.20601 [cs.LG]
	(or arXiv:2412.20601v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.20601

Submission history

From: Pei Zhang [view email]
[v1] Sun, 29 Dec 2024 22:13:16 UTC (8,303 KB)

Computer Science > Machine Learning

Title:MATEY: multiscale adaptive foundation models for spatiotemporal physical systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:MATEY: multiscale adaptive foundation models for spatiotemporal physical systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators