Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL

Zhong, Ruiqi; Snell, Charlie; Klein, Dan; Eisner, Jason

Computer Science > Computation and Language

arXiv:2205.12422 (cs)

[Submitted on 25 May 2022 (v1), last revised 23 Oct 2023 (this version, v3)]

Title:Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL

Authors:Ruiqi Zhong, Charlie Snell, Dan Klein, Jason Eisner

View PDF

Abstract:Can non-programmers annotate natural language utterances with complex programs that represent their meaning? We introduce APEL, a framework in which non-programmers select among candidate programs generated by a seed semantic parser (e.g., Codex). Since they cannot understand the candidate programs, we ask them to select indirectly by examining the programs' input-ouput examples. For each utterance, APEL actively searches for a simple input on which the candidate programs tend to produce different outputs. It then asks the non-programmers only to choose the appropriate output, thus allowing us to infer which program is correct and could be used to fine-tune the parser. As a first case study, we recruited human non-programmers to use APEL to re-annotate SPIDER, a text-to-SQL dataset. Our approach achieved the same annotation accuracy as the original expert annotators (75%) and exposed many subtle errors in the original annotations.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)
Cite as:	arXiv:2205.12422 [cs.CL]
	(or arXiv:2205.12422v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.12422

Submission history

From: Ruiqi Zhong [view email]
[v1] Wed, 25 May 2022 00:35:12 UTC (2,779 KB)
[v2] Sat, 14 Oct 2023 03:19:16 UTC (10,828 KB)
[v3] Mon, 23 Oct 2023 11:12:48 UTC (10,776 KB)

Computer Science > Computation and Language

Title:Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators