five

Trelis/touch-rugby-benchmark

收藏
Hugging Face2025-04-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Trelis/touch-rugby-benchmark
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了多个配置的数据,每个配置都有不同的特征和用途: 1. chunked配置:包含文档ID、文档文本、文档文件名、文档元数据(文件大小)、原始文档摘要、文档摘要、摘要模型、文本块信息、多跳文本块信息、块信息指标和块处理模型等特征。 2. ingested配置:包含文档ID、文档文本、文档文件名和文档元数据(文件大小)等特征。 3. lighteval配置:包含问题、真实答案、问题类别、问题类型、估计难度、引用、文档ID、文本块ID、问题生成模型、文本块和文档文本等特征。 4. multi_hop_questions配置:包含文档ID、源文本块ID、问题、自我答案、估计难度、自我评估问题类型、生成模型、思考过程和引用等特征。 5. single_shot_questions配置:包含文本块ID、文档ID、问题、自我答案、估计难度、自我评估问题类型、生成模型、思考过程和原始响应等特征。 6. summarized配置:包含文档ID、文档文本、文档文件名、文档元数据(文件大小)、原始文档摘要、文档摘要和摘要模型等特征。每个配置都有训练集分割,且数据集提供了不同大小的下载和实际数据集大小信息。

The dataset contains multiple configurations, each with different features and purposes: 1. chunked configuration: includes features like document ID, document text, document filename, document metadata (file size), raw document summary, document summary, summarization model, text chunk information, multi-hop text chunk information, chunk information metrics, and chunking model. 2. ingested configuration: includes features like document ID, document text, document filename, and document metadata (file size). 3. lighteval configuration: includes features like question, ground truth answer, question category, question type, estimated difficulty, citations, document ID, chunk IDs, question generating model, chunks, and document text. 4. multi_hop_questions configuration: includes features like document ID, source chunk IDs, question, self answer, estimated difficulty, self-assessed question type, generating model, thought process, and citations. 5. single_shot_questions configuration: includes features like chunk ID, document ID, question, self answer, estimated difficulty, self-assessed question type, generating model, thought process, and raw response. 6. summarized configuration: includes features like document ID, document text, document filename, document metadata (file size), raw document summary, document summary, and summarization model. Each configuration has a training set split, and the dataset provides different download sizes and actual dataset size information.
提供机构:
Trelis
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作