ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees

Wang, Zhiyuan; Duan, Jinhao; Cheng, Lu; Zhang, Yue; Wang, Qingni; Shi, Xiaoshuang; Xu, Kaidi; Shen, Hengtao; Zhu, Xiaofeng

Computer Science > Computation and Language

arXiv:2407.00499 (cs)

[Submitted on 29 Jun 2024 (v1), last revised 18 Nov 2024 (this version, v3)]

Title:ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees

Authors:Zhiyuan Wang, Jinhao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Xiaoshuang Shi, Kaidi Xu, Hengtao Shen, Xiaofeng Zhu

View PDF HTML (experimental)

Abstract:Uncertainty quantification (UQ) in natural language generation (NLG) tasks remains an open challenge, exacerbated by the closed-source nature of the latest large language models (LLMs). This study investigates applying conformal prediction (CP), which can transform any heuristic uncertainty notion into rigorous prediction sets, to black-box LLMs in open-ended NLG tasks. We introduce a novel uncertainty measure based on self-consistency theory, and then develop a conformal uncertainty criterion by integrating the uncertainty condition aligned with correctness into the CP algorithm. Empirical evaluations indicate that our uncertainty measure outperforms prior state-of-the-art methods. Furthermore, we achieve strict control over the correctness coverage rate utilizing 7 popular LLMs on 4 free-form NLG datasets, spanning general-purpose and medical scenarios. Additionally, the calibrated prediction sets with small size further highlights the efficiency of our method in providing trustworthy guarantees for practical open-ended NLG applications.

Comments:	Accepted by EMNLP 2024 Findings
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2407.00499 [cs.CL]
	(or arXiv:2407.00499v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.00499

Submission history

From: Zhiyuan Wang [view email]
[v1] Sat, 29 Jun 2024 17:33:07 UTC (4,518 KB)
[v2] Sun, 20 Oct 2024 04:17:20 UTC (5,078 KB)
[v3] Mon, 18 Nov 2024 08:33:35 UTC (5,079 KB)

Computer Science > Computation and Language

Title:ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators