five

HalluciGen-PG

收藏
arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/Eloquent/HalluciGen-PG
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为HalluciGen-PG,包含了用于生成释义的示例,其中模型会接收到英文和瑞典语两种可能的给定源句子的释义。每个示例都包括一个源句子、一个正确的假设以及一个包含内在幻觉的错误假设。此外,该数据集还包含了被归类为十一种不同类型错误或添加的幻觉假设,这些错误或添加破坏了蕴涵关系。规模上,数据集包含了138个英文示例和139个瑞典示例,其任务是进行释义生成。

The dataset named HalluciGen-PG contains examples for paraphrase generation, where models are provided with paraphrases of given source sentences in two languages: English and Swedish. Each example consists of a source sentence, a correct hypothesis, and an erroneous hypothesis containing inherent hallucinations. Furthermore, the dataset includes hallucinatory hypotheses categorized into 11 distinct types of errors or additions that break the entailment relationship. In terms of scale, the dataset has 138 English examples and 139 Swedish examples, with the task of the dataset being paraphrase generation.
提供机构:
Eloquent
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作