llm-semantic-router/halueval-spans
收藏Hugging Face2026-01-09 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/llm-semantic-router/halueval-spans
下载链接
链接失效反馈官方服务:
资源简介:
HaluEval Span-Level Dataset (LLM-Detected) 是一个高质量的跨度级幻觉检测数据集,通过使用 Qwen2.5-72B-Instruct 从 HaluEval 转换而来,提供精确的跨度检测和 RAGTruth 兼容的标签。数据集包含 10,000 个摘要样本,其中 8,905 个样本检测到幻觉(89%),总共有 16,359 个幻觉跨度。数据集包含四种 RAGTruth 兼容的标签类型:明显无根据信息、明显冲突、微妙无根据信息和微妙冲突。该数据集专为跨度级幻觉检测任务设计,如训练令牌级幻觉检测器、多标签分类和研究细粒度幻觉类型。它与 RAGTruth 兼容,适合联合训练和评估。
The HaluEval Span-Level Dataset (LLM-Detected) is a high-quality span-level hallucination detection dataset converted from HaluEval using Qwen2.5-72B-Instruct for precise span detection and RAGTruth-compatible labeling. It contains 10,000 summarization samples with 8,905 samples containing detected hallucinations (89%) and 16,359 total hallucinated spans. The dataset features four RAGTruth-compatible label types: Evident Baseless Info, Evident Conflict, Subtle Baseless Info, and Subtle Conflict. Designed for span-level hallucination detection tasks, it is suitable for training token-level hallucination detectors, multi-label classification, and research on fine-grained hallucination types. It is compatible with RAGTruth and suitable for combined training and evaluation.
提供机构:
llm-semantic-router



