purpcode/code-r1-46k-leetcode2k-kodcode
收藏Hugging Face2025-08-10 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/purpcode/code-r1-46k-leetcode2k-kodcode
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含多个字段的数据集,主要用于训练和测试NLP模型。数据集的字段包括任务ID、提示信息(包括内容和角色)、入口点、测试字段、完成字段、示例(输入和输出)、源、元信息(包括难度、语言代码、查询、问题ID、问题标题、响应和分割信息)、数据来源、能力和奖励模型(包括真实标签和风格)。数据集分为训练集、测试集和特殊训练集(sft),并提供默认配置,指定了各个分割的数据文件路径。
This is a dataset with multiple fields primarily designed for training and testing NLP models. The dataset fields include task ID, prompt (including content and role), entry point, test field, completion, examples (input and output), source, metadata (including difficulty, language code, query, question ID, question title, response, and split information), data source, ability, and reward model (including ground truth and style). The dataset is split into training set, test set, and special training set (sft), and provides default configurations specifying the data file paths for each split.
提供机构:
purpcode



