TEEN-D/grpo-oumi-c2d-d2c-subset
收藏Hugging Face2025-04-24 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/TEEN-D/grpo-oumi-c2d-d2c-subset
下载链接
链接失效反馈官方服务:
资源简介:
GRPO Oumi ANLI子集数据集是对oumi-ai/oumi-c2d-d2c-subset数据集的重新格式化版本,专为GRPO训练器设计。数据集由一系列字典组成,每个字典代表一个数据实例,包含prompt和completion字段。prompt字段包括上下文文档和用户请求,completion字段包括模型预期的响应,如主张、子主张、引用标记、解释和支持状态。
This dataset is a reformatted version of the `oumi-ai/oumi-c2d-d2c-subset` dataset, specifically structured for use with the GRPO trainer. The dataset consists of a list of dictionaries, each representing a single data instance with a `prompt` and a `completion` field.
提供机构:
TEEN-D



