Improving Consistency in Large Language Models through Chain of Guidance

Raj, Harsh; Gupta, Vipul; Rosati, Domenic; Majumdar, Subhabrata

Computer Science > Computation and Language

arXiv:2502.15924 (cs)

[Submitted on 21 Feb 2025]

Title:Improving Consistency in Large Language Models through Chain of Guidance

Authors:Harsh Raj, Vipul Gupta, Domenic Rosati, Subhabrata Majumdar

View PDF HTML (experimental)

Abstract:Consistency is a fundamental dimension of trustworthiness in Large Language Models (LLMs). For humans to be able to trust LLM-based applications, their outputs should be consistent when prompted with inputs that carry the same meaning or intent. Despite this need, there is no known mechanism to control and guide LLMs to be more consistent at inference time. In this paper, we introduce a novel alignment strategy to maximize semantic consistency in LLM outputs. Our proposal is based on Chain of Guidance (CoG), a multistep prompting technique that generates highly consistent outputs from LLMs. For closed-book question-answering (Q&A) tasks, when compared to direct prompting, the outputs generated using CoG show improved consistency. While other approaches like template-based responses and majority voting may offer alternative paths to consistency, our work focuses on exploring the potential of guided prompting. We use synthetic data sets comprised of consistent input-output pairs to fine-tune LLMs to produce consistent and correct outputs. Our fine-tuned models are more than twice as consistent compared to base models and show strong generalization capabilities by producing consistent outputs over datasets not used in the fine-tuning process.

Comments:	Accepted at Transactions of Machine Learning Research (TMLR) 2025
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.6; I.5.1
Cite as:	arXiv:2502.15924 [cs.CL]
	(or arXiv:2502.15924v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.15924

Submission history

From: Harsh Raj [view email]
[v1] Fri, 21 Feb 2025 20:41:37 UTC (773 KB)

Computer Science > Computation and Language

Title:Improving Consistency in Large Language Models through Chain of Guidance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Improving Consistency in Large Language Models through Chain of Guidance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators