Large Language Model Evaluation via Matrix Entropy

Wei, Lai; Tan, Zhiquan; Li, Chenghai; Wang, Jindong; Huang, Weiran

Computer Science > Machine Learning

arXiv:2401.17139v1 (cs)

[Submitted on 30 Jan 2024 (this version), latest version 14 Oct 2024 (v2)]

Title:Large Language Model Evaluation via Matrix Entropy

Authors:Lai Wei, Zhiquan Tan, Chenghai Li, Jindong Wang, Weiran Huang

View PDF

Abstract:Large language models (LLMs) have revolutionized the field of natural language processing, extending their strong capabilities into multi-modal domains. Thus, it is vital to define proper and diversified metrics for the evaluation of LLMs.
In this paper, we introduce matrix entropy, a novel metric rooted in information theory and geometry principles to quantify the data compression proficiency in LLMs. It reflects the model's ability to extract relevant information and eliminate unnecessary elements, thereby providing insight into the language model's intrinsic capability. Specifically, we demonstrate its applicability in both single-modal (language) and multi-modal settings. For language models, our findings reveal that the matrix entropy of representations follows a scaling law type reduction when the model scales up, serving as a complement to the traditional loss scaling law. For the multi-modal setting, we also propose an evaluation method based on matrix entropy for assessing alignment quality and we find that modern large multi-modal models exhibit great alignment performance.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
Cite as:	arXiv:2401.17139 [cs.LG]
	(or arXiv:2401.17139v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2401.17139

Submission history

From: Lai Wei [view email]
[v1] Tue, 30 Jan 2024 16:19:55 UTC (197 KB)
[v2] Mon, 14 Oct 2024 04:36:09 UTC (166 KB)

Computer Science > Machine Learning

Title:Large Language Model Evaluation via Matrix Entropy

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Large Language Model Evaluation via Matrix Entropy

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators