saurabh5/RL0-Math-Data
收藏Hugging Face2025-10-20 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/saurabh5/RL0-Math-Data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,其中包括提示文本(prompt)、解决方案(solution)、数据来源(data_source)、角色(role)、能力(ability)、奖励模型(reward_model)、额外信息(extra_info)、数据集名称(dataset)、真实值(ground_truth)、前端ID(frontier_id)、OLMO完成(olmo_completions)、验证分数(verifier_scores)、平均分数(avg_score)、数据集来源(dataset_source)、输入ID(prompt的input_ids和source的input_ids)、注意力掩码(attention_mask)和标签(labels)。数据集分为训练集(train),大小为2.90GB,共有13314个样本。
The dataset includes multiple fields such as prompt, solution, data_source, role, ability, reward_model, extra_info, dataset name, ground_truth, frontier_id, OLMO completions, verifier_scores, avg_score, dataset_source, input_ids for prompt and source, attention_mask, and labels. The dataset is split into a training set (train), which is 2.90GB in size and contains 13,314 samples.
提供机构:
saurabh5



