SEMQA: Semi-Extractive Multi-Source Question Answering

Schuster, Tal; Lelkes, Adam D.; Sun, Haitian; Gupta, Jai; Berant, Jonathan; Cohen, William W.; Metzler, Donald

Computer Science > Computation and Language

arXiv:2311.04886 (cs)

[Submitted on 8 Nov 2023 (v1), last revised 30 Jun 2024 (this version, v2)]

Title:SEMQA: Semi-Extractive Multi-Source Question Answering

Authors:Tal Schuster, Adam D. Lelkes, Haitian Sun, Jai Gupta, Jonathan Berant, William W. Cohen, Donald Metzler

View PDF HTML (experimental)

Abstract:Recently proposed long-form question answering (QA) systems, supported by large language models (LLMs), have shown promising capabilities. Yet, attributing and verifying their generated abstractive answers can be difficult, and automatically evaluating their accuracy remains an ongoing challenge.
In this work, we introduce a new QA task for answering multi-answer questions by summarizing multiple diverse sources in a semi-extractive fashion. Specifically, Semi-extractive Multi-source QA (SEMQA) requires models to output a comprehensive answer, while mixing factual quoted spans -- copied verbatim from given input sources -- and non-factual free-text connectors that glue these spans together into a single cohesive passage. This setting bridges the gap between the outputs of well-grounded but constrained extractive QA systems and more fluent but harder to attribute fully abstractive answers. Particularly, it enables a new mode for language models that leverages their advanced language generation capabilities, while also producing fine in-line attributions by-design that are easy to verify, interpret, and evaluate.
To study this task, we create the first dataset of this kind, QuoteSum, with human-written semi-extractive answers to natural and generated questions, and define text-based evaluation metrics. Experimenting with several LLMs in various settings, we find this task to be surprisingly challenging, demonstrating the importance of QuoteSum for developing and studying such consolidation capabilities.

Comments:	NAACL 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2311.04886 [cs.CL]
	(or arXiv:2311.04886v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.04886

Submission history

From: Tal Schuster [view email]
[v1] Wed, 8 Nov 2023 18:46:32 UTC (9,544 KB)
[v2] Sun, 30 Jun 2024 18:53:22 UTC (9,549 KB)

Computer Science > Computation and Language

Title:SEMQA: Semi-Extractive Multi-Source Question Answering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SEMQA: Semi-Extractive Multi-Source Question Answering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators