Chunk-Distilled Language Modeling

Li, Yanhong; Livescu, Karen; Zhou, Jiawei

Computer Science > Computation and Language

arXiv:2501.00343 (cs)

[Submitted on 31 Dec 2024]

Title:Chunk-Distilled Language Modeling

Authors:Yanhong Li, Karen Livescu, Jiawei Zhou

View PDF HTML (experimental)

Abstract:We introduce Chunk-Distilled Language Modeling (CD-LM), an approach to text generation that addresses two challenges in current large language models (LLMs): the inefficiency of token-level generation, and the difficulty of adapting to new data and knowledge. Our method combines deep network-based LLMs with a straightforward retrieval module, which allows the generation of multi-token text chunks at a single decoding step. Our retrieval framework enables flexible construction of model- or domain-specific datastores, either leveraging the internal knowledge of existing models, or incorporating expert insights from human-annotated corpora. This adaptability allows for enhanced control over the language model's distribution without necessitating additional training. We present the CD-LM formulation along with performance metrics demonstrating its ability to improve language model performance and efficiency across a diverse set of downstream tasks. Code and data will be made publicly available.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.00343 [cs.CL]
	(or arXiv:2501.00343v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.00343

Submission history

From: Yanhong Li [view email]
[v1] Tue, 31 Dec 2024 08:32:15 UTC (1,050 KB)

Computer Science > Computation and Language

Title:Chunk-Distilled Language Modeling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Chunk-Distilled Language Modeling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators