SWSC: Shared Weight for Similar Channel in LLM

Zeng, Binrui; Tang, Yongtao; Liu, Xiaodong; Li, Xiaopeng

Computer Science > Machine Learning

arXiv:2501.08631 (cs)

[Submitted on 15 Jan 2025]

Title:SWSC: Shared Weight for Similar Channel in LLM

Authors:Binrui Zeng, Yongtao Tang, Xiaodong Liu, Xiaopeng Li

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have spurred development in multiple industries. However, the growing number of their parameters brings substantial storage and computing burdens, making it essential to explore model compression techniques for parameter reduction and easier deployment. We propose SWSC, an LLM compression method based on the concept of Shared Weight for Similar Channel. It uses the K-Means clustering algorithm to cluster model weights channel-by-channel, generating clusters with highly similar vectors within each. A representative vector from each cluster is selected to approximately replace all vectors in the cluster, significantly reducing the number of model weight parameters. However, approximate restoration will inevitably cause damage to the performance of the model. To tackle this issue, we perform singular value decomposition on the weight error values before and after compression and retain the larger singular values and their corresponding singular vectors to compensate for the accuracy. The experimental results show that our method can effectively ensure the performance of the compressed LLM even under low-precision conditions.

Comments:	5pages, 3 figures, work in progress
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2501.08631 [cs.LG]
	(or arXiv:2501.08631v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.08631

Submission history

From: Binrui Zeng [view email]
[v1] Wed, 15 Jan 2025 07:36:19 UTC (213 KB)

Computer Science > Machine Learning

Title:SWSC: Shared Weight for Similar Channel in LLM

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SWSC: Shared Weight for Similar Channel in LLM

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators