five

yourbench-testing/test-custom-schema-simple

收藏
Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/yourbench-testing/test-custom-schema-simple
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为Test Custom Schema Simple,是通过YourBench(v0.6.0)开源框架生成的,用于从文档集合中生成特定领域的基准测试。数据集包含三个配置:chunked、ingested和single_shot_questions,分别对应不同的数据处理步骤和特征。ingestion步骤读取原始文档并转换为标准化markdown;chunking步骤将文本分割为基于token的单跳和多跳块;single_shot_question_generation步骤使用LLM为每个块生成独立的问答对。数据集旨在为特定领域的基准测试提供支持。

This dataset, named Test Custom Schema Simple, was generated using YourBench (v0.6.0), an open-source framework for generating domain-specific benchmarks from document collections. The dataset includes three configurations: chunked, ingested, and single_shot_questions, each corresponding to different data processing steps and features. The ingestion step reads raw source documents and converts them to normalized markdown; the chunking step splits texts into token-based single-hop and multi-hop chunks; the single_shot_question_generation step generates standalone question-answer pairs per chunk using LLM. The dataset is designed to support domain-specific benchmarking.
提供机构:
yourbench-testing
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作