IIGroup/X-Coder-RL-40k
收藏Hugging Face2026-02-07 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/IIGroup/X-Coder-RL-40k
下载链接
链接失效反馈官方服务:
资源简介:
X-Coder-RL-40k是一个完全合成的强化学习数据集,用于竞争性编程,包含40k个高质量的任务和已验证的测试用例。任务由o3-mini合成,测试用例由Gemini-2.5-Pro合成。数据集按难度级别组织,从最简单到最困难。该数据集旨在用于代码生成模型的RLVR训练。
X-Coder-RL-40k is a fully synthetic reinforcement learning dataset for competitive programming, containing 40k high-quality tasks with verified test cases. The tasks are synthesized by o3-mini, and the test cases are synthesized by Gemini-2.5-Pro. The dataset is organized by difficulty level, ranging from easiest to hardest. It is intended for RLVR training for code generation models.
提供机构:
IIGroup



