Vocabulary-level Memory Efficiency for Language Model Fine-tuning

Williams, Miles; Aletras, Nikolaos

Computer Science > Computation and Language

arXiv:2309.08708 (cs)

[Submitted on 15 Sep 2023 (v1), last revised 25 Mar 2025 (this version, v2)]

Title:Vocabulary-level Memory Efficiency for Language Model Fine-tuning

Authors:Miles Williams, Nikolaos Aletras

View PDF HTML (experimental)

Abstract:The extensive memory footprint of language model (LM) fine-tuning poses a challenge for both researchers and practitioners. LMs use an embedding matrix to represent extensive vocabularies, forming a substantial proportion of the model parameters. While previous work towards memory-efficient fine-tuning has focused on minimizing the number of trainable parameters, reducing the memory footprint of the embedding matrix has yet to be explored. We first demonstrate that a significant proportion of the vocabulary remains unused during fine-tuning. We then propose a simple yet effective approach that leverages this finding to minimize memory usage. We show that our approach provides substantial reductions in memory usage across a wide range of models and tasks. Notably, our approach does not impact downstream task performance, while allowing more efficient use of computational resources.

Comments:	RepL4NLP 2025
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2309.08708 [cs.CL]
	(or arXiv:2309.08708v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.08708

Submission history

From: Miles Williams [view email]
[v1] Fri, 15 Sep 2023 19:00:00 UTC (7,679 KB)
[v2] Tue, 25 Mar 2025 13:30:00 UTC (115 KB)

Computer Science > Computation and Language

Title:Vocabulary-level Memory Efficiency for Language Model Fine-tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Vocabulary-level Memory Efficiency for Language Model Fine-tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators