opencompass/NeedleBench
收藏Hugging Face2025-09-01 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/opencompass/NeedleBench
下载链接
链接失效反馈官方服务:
资源简介:
NeedleBench数据集是OpenCompass项目的一部分,旨在评估大型语言模型在处理和理解长文档方面的能力。它包括一系列测试场景,用于评估模型在长文本信息提取和推理方面的能力。该数据集支持单针检索、多针检索、多针推理和祖先追踪挑战等任务。数据集支持多种语言,包括中文和英文。
The NeedleBench dataset is part of the OpenCompass project, designed to evaluate the capabilities of large language models (LLMs) in processing and understanding long documents. It includes a series of test scenarios that assess models abilities in long text information extraction and reasoning. The dataset supports tasks such as single-needle retrieval, multi-needle retrieval, multi-needle reasoning, and ancestral trace challenges, and it is available in multiple languages, including English and Chinese.
提供机构:
opencompass
原始信息汇总
数据集概述
许可证
- 许可证类型:MIT
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



