Consistency Matters: Explore LLMs Consistency From a Black-Box Perspective

Zhao, Fufangchen; Jin, Guoqiang; Huang, Jiaheng; Zhao, Rui; Tan, Fei

Computer Science > Computation and Language

arXiv:2402.17411 (cs)

This paper has been withdrawn by Fufangchen Zhao

[Submitted on 27 Feb 2024 (v1), last revised 2 Mar 2024 (this version, v2)]

Title:Consistency Matters: Explore LLMs Consistency From a Black-Box Perspective

Authors:Fufangchen Zhao, Guoqiang Jin, Jiaheng Huang, Rui Zhao, Fei Tan

No PDF available, click to view other formats

Abstract:Nowadays both commercial and open-source academic LLM have become the mainstream models of NLP. However, there is still a lack of research on LLM consistency, meaning that throughout the various stages of LLM research and deployment, its internal parameters and capabilities should remain unchanged. This issue exists in both the industrial and academic sectors. The solution to this problem is often time-consuming and labor-intensive, and there is also an additional cost of secondary deployment, resulting in economic and time losses. To fill this gap, we build an LLM consistency task dataset and design several baselines. Additionally, we choose models of diverse scales for the main experiments. Specifically, in the LightGBM experiment, we used traditional NLG metrics (i.e., ROUGE, BLEU, METEOR) as the features needed for model training. The final result exceeds the manual evaluation and GPT3.5 as well as other models in the main experiment, achieving the best performance. In the end, we use the best performing LightGBM model as the base model to build the evaluation tool, which can effectively assist in the deployment of business models. Our code and tool demo are available at this https URL

Comments:	This paper is not ready
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.17411 [cs.CL]
	(or arXiv:2402.17411v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.17411

Submission history

From: Fufangchen Zhao [view email]
[v1] Tue, 27 Feb 2024 11:02:12 UTC (974 KB)
[v2] Sat, 2 Mar 2024 14:08:06 UTC (1 KB) (withdrawn)

Computer Science > Computation and Language

Title:Consistency Matters: Explore LLMs Consistency From a Black-Box Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Consistency Matters: Explore LLMs Consistency From a Black-Box Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators