yikeee/Open-Reward-Agent-rubric-sft-mix-openrubric-only
收藏Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/yikeee/Open-Reward-Agent-rubric-sft-mix-openrubric-only
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为Open-Reward-Agent-rubric-sft-mix-openrubric-only,可能用于AI代理的奖励模型训练或监督微调,基于规则(rubric)进行混合。数据集包含消息对话(包括内容和角色)、数据集来源和标签等特征,分为训练集(10689个示例)和评估集(1203个示例),用于自然语言处理任务。
The dataset named Open-Reward-Agent-rubric-sft-mix-openrubric-only is likely used for training or evaluating AI agents in reward modeling or supervised fine-tuning, based on a rubric mix. It features message conversations (including content and roles), dataset sources, and tags, with splits for training (10,689 examples) and evaluation (1,203 examples), designed for natural language processing tasks.
提供机构:
yikeee



