On Subjective Uncertainty Quantification and Calibration in Natural Language Generation

Wang, Ziyu; Holmes, Chris

Computer Science > Computation and Language

arXiv:2406.05213 (cs)

[Submitted on 7 Jun 2024]

Title:On Subjective Uncertainty Quantification and Calibration in Natural Language Generation

Authors:Ziyu Wang, Chris Holmes

View PDF HTML (experimental)

Abstract:Applications of large language models often involve the generation of free-form responses, in which case uncertainty quantification becomes challenging. This is due to the need to identify task-specific uncertainties (e.g., about the semantics) which appears difficult to define in general cases. This work addresses these challenges from a perspective of Bayesian decision theory, starting from the assumption that our utility is characterized by a similarity measure that compares a generated response with a hypothetical true response. We discuss how this assumption enables principled quantification of the model's subjective uncertainty and its calibration. We further derive a measure for epistemic uncertainty, based on a missing data perspective and its characterization as an excess risk. The proposed measures can be applied to black-box language models. We demonstrate the proposed methods on question answering and machine translation tasks, where they extract broadly meaningful uncertainty estimates from GPT and Gemini models and quantify their calibration.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2406.05213 [cs.CL]
	(or arXiv:2406.05213v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.05213

Submission history

From: Ziyu Wang [view email]
[v1] Fri, 7 Jun 2024 18:54:40 UTC (58 KB)

Computer Science > Computation and Language

Title:On Subjective Uncertainty Quantification and Calibration in Natural Language Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On Subjective Uncertainty Quantification and Calibration in Natural Language Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators