mlfoundations-dev/multiple_samples_shortest_seed_code_w_openthoughts
收藏Hugging Face2025-02-17 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/multiple_samples_shortest_seed_code_w_openthoughts
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了system系统信息、对话信息(包括对话来源和内容)、问题描述、推理过程、解决方案、唯一标识符、真实解决方案、数据来源、代码字段(可能为空)、正确性标识、评估推理、原始行索引、基于r1_distill_70b模型的响应、多数响应和经过验证的r1_distill_70b模型响应等字段。数据集被划分为训练集,包含约120668个样本,总大小约为3.24GB。
The dataset includes fields such as system information, conversation information (including the source and content of the conversation), problem description, reasoning process, solution, unique identifier, ground truth solution, data source, code (which may be null), correctness flag, evaluation of reasoning, original row index, response based on r1_distill_70b model, majority responses, and verified r1_distill_70b model response. The dataset is split into a training set with approximately 120,668 samples, totaling about 3.24GB in size.
提供机构:
mlfoundations-dev



