llm-editing/HalluEditBench

Name: llm-editing/HalluEditBench
Creator: llm-editing
Published: 2025-06-09 16:24:07
License: 暂无描述

Hugging Face2025-06-09 更新2025-04-26 收录

下载链接：

https://hf-mirror.com/datasets/llm-editing/HalluEditBench

下载链接

链接失效反馈

官方服务：

资源简介：

HalluEditBench是一个全面评估知识编辑方法在纠正大型语言模型（LLMs）中虚假信息性能的数据集。它包含了9个领域、26个主题和超过6000个虚假信息实例，用于评估知识编辑方法在五个维度上的性能，包括有效性、泛化能力、迁移性、局部性和鲁棒性。

HalluEditBench is a dataset designed to holistically benchmark the performance of knowledge editing methods in correcting hallucinations in Large Language Models (LLMs). It contains over 6,000 hallucination instances across 9 domains and 26 topics, used to assess the performance of knowledge editing methods on five dimensions: Efficacy, Generalization, Portability, Locality, and Robustness.

提供机构：

llm-editing

5,000+

优质数据集

54 个

任务类型

进入经典数据集