Dynamic Tensor Rematerialization

Kirisame, Marisa; Lyubomirsky, Steven; Haan, Altan; Brennan, Jennifer; He, Mike; Roesch, Jared; Chen, Tianqi; Tatlock, Zachary

Computer Science > Machine Learning

arXiv:2006.09616v1 (cs)

[Submitted on 17 Jun 2020 (this version), latest version 18 Mar 2021 (v4)]

Title:Dynamic Tensor Rematerialization

Authors:Marisa Kirisame, Steven Lyubomirsky, Altan Haan, Jennifer Brennan, Mike He, Jared Roesch, Tianqi Chen, Zachary Tatlock

View PDF

Abstract:Checkpointing enables training larger models by freeing intermediate activations and recomputing them on demand. Previous checkpointing techniques are difficult to generalize to dynamic models because they statically plan recomputations offline. We present Dynamic Tensor Rematerialization (DTR), a greedy online algorithm for heuristically checkpointing arbitrary models. DTR is extensible and general: it is parameterized by an eviction policy and only collects lightweight metadata on tensors and operators. Though DTR has no advance knowledge of the model or training task, we prove it can train an $N$-layer feedforward network on an $\Omega(\sqrt{N})$ memory budget with only $\mathcal{O}(N)$ tensor operations. Moreover, we identify a general eviction heuristic and show how it allows DTR to automatically provide favorable checkpointing performance across a variety of models and memory budgets.

Comments:	28 pages, 11 figures, implementation available here: this https URL
Subjects:	Machine Learning (cs.LG); Programming Languages (cs.PL); Machine Learning (stat.ML)
ACM classes:	C.3
Cite as:	arXiv:2006.09616 [cs.LG]
	(or arXiv:2006.09616v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.09616

Submission history

From: Steven Lyubomirsky [view email]
[v1] Wed, 17 Jun 2020 02:49:59 UTC (1,001 KB)
[v2] Thu, 18 Jun 2020 03:02:50 UTC (1,001 KB)
[v3] Mon, 12 Oct 2020 21:32:29 UTC (1,507 KB)
[v4] Thu, 18 Mar 2021 06:20:23 UTC (1,589 KB)

Computer Science > Machine Learning

Title:Dynamic Tensor Rematerialization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Dynamic Tensor Rematerialization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators