FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

Aly, Rami; Guo, Zhijiang; Schlichtkrull, Michael; Thorne, James; Vlachos, Andreas; Christodoulopoulos, Christos; Cocarascu, Oana; Mittal, Arpit

Computer Science > Computation and Language

arXiv:2106.05707 (cs)

[Submitted on 10 Jun 2021 (v1), last revised 12 Oct 2021 (this version, v3)]

Title:FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

Authors:Rami Aly, Zhijiang Guo, Michael Schlichtkrull, James Thorne, Andreas Vlachos, Christos Christodoulopoulos, Oana Cocarascu, Arpit Mittal

View PDF

Abstract:Fact verification has attracted a lot of attention in the machine learning and natural language processing communities, as it is one of the key methods for detecting misinformation. Existing large-scale benchmarks for this task have focused mostly on textual sources, i.e. unstructured information, and thus ignored the wealth of information available in structured formats, such as tables. In this paper we introduce a novel dataset and benchmark, Fact Extraction and VERification Over Unstructured and Structured information (FEVEROUS), which consists of 87,026 verified claims. Each claim is annotated with evidence in the form of sentences and/or cells from tables in Wikipedia, as well as a label indicating whether this evidence supports, refutes, or does not provide enough information to reach a verdict. Furthermore, we detail our efforts to track and minimize the biases present in the dataset and could be exploited by models, e.g. being able to predict the label without using evidence. Finally, we develop a baseline for verifying claims against text and tables which predicts both the correct evidence and verdict for 18% of the claims.

Comments:	Accepted at NeurIPS 2021 Datasets and Benchmarks Track
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2106.05707 [cs.CL]
	(or arXiv:2106.05707v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2106.05707

Submission history

From: Rami Aly [view email]
[v1] Thu, 10 Jun 2021 12:47:36 UTC (7,312 KB)
[v2] Thu, 16 Sep 2021 11:32:55 UTC (10,043 KB)
[v3] Tue, 12 Oct 2021 09:41:34 UTC (10,043 KB)

Computer Science > Computation and Language

Title:FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators