Investigating Language-Specific Calibration For Pruning Multilingual Large Language Models

Kurz, Simon; Chen, Jian-Jia; Flek, Lucie; Zhao, Zhixue

Computer Science > Computation and Language

arXiv:2408.14398 (cs)

[Submitted on 26 Aug 2024 (v1), last revised 30 Oct 2024 (this version, v3)]

Title:Investigating Language-Specific Calibration For Pruning Multilingual Large Language Models

Authors:Simon Kurz, Jian-Jia Chen, Lucie Flek, Zhixue Zhao

View PDF HTML (experimental)

Abstract:Recent advances in large language model (LLM) pruning have shown state-of-the-art (SotA) compression results in post-training and retraining-free settings while maintaining high predictive performance. However, previous research mainly considered calibrating based on English text, despite the multilingual nature of modern LLMs and their frequent use in non-English languages. In this paper, we set out to investigate calibrating the pruning of multilingual language models for monolingual applications. We present the first comprehensive empirical study, comparing different calibration languages for pruning multilingual models across diverse languages, tasks, models, and SotA pruning techniques. Our results offer practical suggestions, for example, calibrating in the target language can efficiently retain the language modeling capability but does not necessarily benefit downstream tasks. Through further analysis of latent subspaces, pruning masks, and individual neurons within pruned models, we find that while pruning generally preserves strong language-specific features, it may fail to retain language-specific neuron activation patterns and subtle, language-agnostic features associated with knowledge and reasoning that are needed for complex tasks.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2408.14398 [cs.CL]
	(or arXiv:2408.14398v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.14398

Submission history

From: Simon Kurz [view email]
[v1] Mon, 26 Aug 2024 16:29:13 UTC (9,205 KB)
[v2] Wed, 28 Aug 2024 12:03:54 UTC (9,205 KB)
[v3] Wed, 30 Oct 2024 00:53:43 UTC (9,672 KB)

Computer Science > Computation and Language

Title:Investigating Language-Specific Calibration For Pruning Multilingual Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Investigating Language-Specific Calibration For Pruning Multilingual Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators