arcagi2/rl-only-official
收藏Hugging Face2025-10-10 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/arcagi2/rl-only-official
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含文本和结构化信息的复杂数据集,主要用于训练和评估。它包括数据源、提示(包括内容和角色)、能力、额外信息(如谜题ID、来源和分割)、代理名称和奖励模型(包括输入和输出)。数据集分为训练集和评估集,支持默认配置。
This dataset is a complex dataset containing text and structured information, primarily used for training and evaluation. It includes data source, prompt (including content and role), ability, extra information (such as puzzle ID, source, and split), agent name, and reward model (including input and output). The dataset is divided into training and evaluation sets and supports the default configuration.
提供机构:
arcagi2



