Babelscape/LLM-Oasis_e2e_factuality_evaluation

Name: Babelscape/LLM-Oasis_e2e_factuality_evaluation
Creator: Babelscape
Published: 2025-10-15 11:37:23
License: 暂无描述

Hugging Face2025-10-15 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/Babelscape/LLM-Oasis_e2e_factuality_evaluation

下载链接

链接失效反馈

官方服务：

资源简介：

LLM-Oasis_e2e_factuality_evaluation数据集是LLM-Oasis套件的一部分，包含用于评估原始文本事实准确性的黄金标准数据集。文本是维基百科文本的改写或伪造版本。该数据集支持端到端的事实性评估任务。数据集包含两个特征：text用于事实性评估的原始文本，和id每个示例的唯一标识符。数据集的黄金分割包含1,708个示例。

LLM-Oasis_e2e_factuality_evaluation is part of the LLM-Oasis suite and contains the gold-standard dataset for evaluating the factual accuracy of raw texts. Texts are either paraphrases or falsified versions of a text from Wikipedia. This dataset supports the end-to-end factuality evaluation task described in Section 4.2 of the LLM-Oasis paper. The dataset includes two features: text for the raw text to be evaluated for factuality, and id as a unique identifier for each example. The gold split of the dataset contains 1,708 examples.

提供机构：

Babelscape

5,000+

优质数据集

54 个

任务类型

进入经典数据集