Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks

Chernov, Andrei

Computer Science > Computation and Language

arXiv:2502.17187 (cs)

[Submitted on 24 Feb 2025]

Title:Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks

Authors:Andrei Chernov

View PDF HTML (experimental)

Abstract:Recently, Large Language Models (LLMs) with Mixture of Experts (MoE) layers have gained significant attention. Currently, state-of-the-art LLMs utilize this architecture. There is a substantial amount of research on how to train such models and how to select hyperparameters for this architecture. However, there is a lack of studies focusing on post-evaluation analysis of MoE layer properties. In this paper, we take a first step toward closing this gap by evaluating expert contributions on the quiz-based MMLU benchmark. We show that most experts were never activated during inference on this benchmark. Additionally, the output distribution of gating networks is much closer to uniform than sparse. Finally, we demonstrate that the average performance of some experts within the same layer varies significantly.

Comments:	preprint, short paper
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.17187 [cs.CL]
	(or arXiv:2502.17187v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.17187

Submission history

From: Andrei Chernov [view email]
[v1] Mon, 24 Feb 2025 14:23:52 UTC (26 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2025-02

Change to browse by:

cs
cs.AI

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators