llm-semantic-router/halueval-spans

Name: llm-semantic-router/halueval-spans
Creator: llm-semantic-router
Published: 2026-01-09 23:50:44
License: 暂无描述

Hugging Face2026-01-09 更新2026-02-07 收录

下载链接：

https://hf-mirror.com/datasets/llm-semantic-router/halueval-spans

下载链接

链接失效反馈

官方服务：

资源简介：

HaluEval Span-Level Dataset (LLM-Detected) 是一个高质量的跨度级幻觉检测数据集，通过使用 Qwen2.5-72B-Instruct 从 HaluEval 转换而来，提供精确的跨度检测和 RAGTruth 兼容的标签。数据集包含 10,000 个摘要样本，其中 8,905 个样本检测到幻觉（89%），总共有 16,359 个幻觉跨度。数据集包含四种 RAGTruth 兼容的标签类型：明显无根据信息、明显冲突、微妙无根据信息和微妙冲突。该数据集专为跨度级幻觉检测任务设计，如训练令牌级幻觉检测器、多标签分类和研究细粒度幻觉类型。它与 RAGTruth 兼容，适合联合训练和评估。

The HaluEval Span-Level Dataset (LLM-Detected) is a high-quality span-level hallucination detection dataset converted from HaluEval using Qwen2.5-72B-Instruct for precise span detection and RAGTruth-compatible labeling. It contains 10,000 summarization samples with 8,905 samples containing detected hallucinations (89%) and 16,359 total hallucinated spans. The dataset features four RAGTruth-compatible label types: Evident Baseless Info, Evident Conflict, Subtle Baseless Info, and Subtle Conflict. Designed for span-level hallucination detection tasks, it is suitable for training token-level hallucination detectors, multi-label classification, and research on fine-grained hallucination types. It is compatible with RAGTruth and suitable for combined training and evaluation.

提供机构：

llm-semantic-router

5,000+

优质数据集

54 个

任务类型

进入经典数据集