TEEN-D/grpo-oumi-synthetic-document-claims
收藏Hugging Face2025-04-24 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/TEEN-D/grpo-oumi-synthetic-document-claims
下载链接
链接失效反馈官方服务:
资源简介:
GRPO Oumi ANLI子集是一个重新格式化的数据集,基于oumi-ai/oumi-synthetic-document-claims数据集,专门为GRPO训练器构建。它由一系列包含prompt和completion字段的字典组成,用于生成文本、验证主张、指令调整等任务。
The GRPO Oumi ANLI Subset is a reformatted version of the oumi-ai/oumi-synthetic-document-claims dataset, specifically structured for the GRPO trainer. It consists of a series of dictionaries with prompt and completion fields, designed for tasks such as text generation, claim verification, and instruction tuning.
提供机构:
TEEN-D



