Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Mansurov, Jonibek; Sakip, Akhmed; Aji, Alham Fikri

Computer Science > Computation and Language

arXiv:2412.15255 (cs)

[Submitted on 15 Dec 2024]

Title:Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Authors:Jonibek Mansurov, Akhmed Sakip, Alham Fikri Aji

View PDF HTML (experimental)

Abstract:In this paper, we show that knowledge distillation can be subverted to manipulate language model benchmark scores, revealing a critical vulnerability in current evaluation practices. We introduce "Data Laundering," a three-phase process analogous to financial money laundering, that enables the covert transfer of benchmark-specific knowledge through seemingly legitimate intermediate training steps. Through extensive experiments with a 2-layer BERT student model, we show how this approach can achieve substantial improvements in benchmark accuracy (up to 75\% on GPQA) without developing genuine reasoning capabilities. Notably, this method can be exploited intentionally or even unintentionally, as researchers may inadvertently adopt this method that inflates scores using knowledge distillation without realizing the implications. While our findings demonstrate the effectiveness of this technique, we present them as a cautionary tale highlighting the urgent need for more robust evaluation methods in AI. This work aims to contribute to the ongoing discussion about evaluation integrity in AI development and the need for benchmarks that more accurately reflect true model capabilities. The code is available at \url{this https URL}.

Comments:	14 pages
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.15255 [cs.CL]
	(or arXiv:2412.15255v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.15255

Submission history

From: Jonibek Mansurov [view email]
[v1] Sun, 15 Dec 2024 19:38:48 UTC (2,978 KB)

Computer Science > Computation and Language

Title:Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators