kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_1.5B_tokenized
收藏Hugging Face2025-04-04 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_1.5B_tokenized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含问题、解决方案、答案、问题类型、问题类型、来源、唯一标识符、处理后的答案、奖励、最小长度、最大长度、通过率@1、通过率@16、一致性@16、输入序列ID和输出序列ID等字段。数据集分为训练集、验证集和测试集,分别包含47488、1000和1000个示例。数据集的总大小约为58.61GB。
The dataset includes fields such as problem, solution, answer, problem type, question type, source, unique identifier, processed answer, reward, minimum length, maximum length, pass rate@1, pass rate@16, consistency@16, input sequence IDs, and output sequence IDs. The dataset is divided into training, validation, and test sets, containing 47488, 1000, and 1000 examples respectively. The total size of the dataset is approximately 58.61GB.
提供机构:
kaiwenw



