Benchmarking and Improving Generator-Validator Consistency of Language Models

Li, Xiang Lisa; Shrivastava, Vaishnavi; Li, Siyan; Hashimoto, Tatsunori; Liang, Percy

Computer Science > Computation and Language

arXiv:2310.01846 (cs)

[Submitted on 3 Oct 2023]

Title:Benchmarking and Improving Generator-Validator Consistency of Language Models

Authors:Xiang Lisa Li, Vaishnavi Shrivastava, Siyan Li, Tatsunori Hashimoto, Percy Liang

View PDF

Abstract:As of September 2023, ChatGPT correctly answers "what is 7+8" with 15, but when asked "7+8=15, True or False" it responds with "False". This inconsistency between generating and validating an answer is prevalent in language models (LMs) and erodes trust. In this paper, we propose a framework for measuring the consistency between generation and validation (which we call generator-validator consistency, or GV-consistency), finding that even GPT-4, a state-of-the-art LM, is GV-consistent only 76% of the time. To improve the consistency of LMs, we propose to finetune on the filtered generator and validator responses that are GV-consistent, and call this approach consistency fine-tuning. We find that this approach improves GV-consistency of Alpaca-30B from 60% to 93%, and the improvement extrapolates to unseen tasks and domains (e.g., GV-consistency for positive style transfers extrapolates to unseen styles like humor). In addition to improving consistency, consistency fine-tuning improves both generator quality and validator accuracy without using any labeled data. Evaluated across 6 tasks, including math questions, knowledge-intensive QA, and instruction following, our method improves the generator quality by 16% and the validator accuracy by 6.3% across all tasks.

Comments:	preprint
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2310.01846 [cs.CL]
	(or arXiv:2310.01846v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.01846

Submission history

From: Xiang Lisa Li [view email]
[v1] Tue, 3 Oct 2023 07:23:22 UTC (890 KB)

Computer Science > Computation and Language

Title:Benchmarking and Improving Generator-Validator Consistency of Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Benchmarking and Improving Generator-Validator Consistency of Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators