2015. A New Dataset and Evaluation for Belief/Factuality. In Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, pages 82–91, ...
The terms "belief" and "factuality" both refer to the intention of the writer to present the propositional content of an utterance as firmly believed by the ...
A New Dataset and Evaluation for Belief/Factuality · Vinodkumar Prabhakaran, Tomas By, +15 authors. Janyce Wiebe · Published in International Workshop on… 1 June ...
This paper presents an ongoing annotation effort and an associated evaluation. 1 Introduction This paper presents an ongoing project aimed at developing a ...
A New Dataset and Evaluation for Belief/Factuality. SEMEVAL 2015 · Vinodkumar Prabhakaran, Tomas By, Julia Hirschberg, Owen Rambow, Samira Shaikh, ...
BARDA: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability Peter Clark, Bhavana Dalvi Mishra, Oyvind Tafjord
Jun 28, 2024 · We introduce Belief-R1, a new dataset designed to test LMs' belief revision ability when presented with new evidence. Inspired by how humans ...
Oct 1, 2024 · The FRAMES dataset introduced 824 questions to evaluate factuality, retrieval, and reasoning capabilities. Approximately 36% of the dataset ...
Dec 12, 2023 · The resulting dataset, called BaRDa, contains 3000 entailments (1787 valid, 1213 invalid), using 6681 true and 2319 false statements.
This repository contains the data for the FRANK Benchmark for factuality evaluation metrics (see our NAACL 2021 paper for more information).
Missing: Belief/ | Show results with:Belief/