PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics

Larionov, Daniil; Eger, Steffen

Computer Science > Computation and Language

arXiv:2412.16120 (cs)

[Submitted on 20 Dec 2024]

Title:PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics

Authors:Daniil Larionov, Steffen Eger

View PDF HTML (experimental)

Abstract:Evaluating the quality of machine-generated natural language content is a challenging task in Natural Language Processing (NLP). Recently, large language models (LLMs) like GPT-4 have been employed for this purpose, but they are computationally expensive due to the extensive token usage required by complex evaluation prompts. In this paper, we propose a prompt optimization approach that uses a smaller, fine-tuned language model to compress input data for evaluation prompt, thus reducing token usage and computational cost when using larger LLMs for downstream evaluation. Our method involves a two-stage fine-tuning process: supervised fine-tuning followed by preference optimization to refine the model's outputs based on human preferences. We focus on Machine Translation (MT) evaluation and utilize the GEMBA-MQM metric as a starting point. Our results show a $2.37\times$ reduction in token usage without any loss in evaluation quality. This work makes state-of-the-art LLM-based metrics like GEMBA-MQM more cost-effective and efficient, enhancing their accessibility for broader use.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.16120 [cs.CL]
	(or arXiv:2412.16120v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.16120

Submission history

From: Daniil Larionov [view email]
[v1] Fri, 20 Dec 2024 18:08:02 UTC (8,885 KB)

Computer Science > Computation and Language

Title:PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators