sungyub/rstar-coder-verl
收藏Hugging Face2025-11-07 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/sungyub/rstar-coder-verl
下载链接
链接失效反馈官方服务:
资源简介:
rStar-Coder VERL数据集包含386,640个编码问题,这些问题是从microsoft/rStar-Coder集合转换而来,专为大型语言模型的强化学习训练设计。数据集包括用于代码执行验证的基于测试用例的地面真实数据。每个问题都按照VERL格式组织,包含数据源、提示(角色和内容)、能力、奖励模型(评估方法和地面真实数据)、以及额外信息(记录标识和原始问题标识)。
The rStar-Coder VERL Dataset contains 386,640 coding problems transformed from the microsoft/rStar-Coder collection, designed for reinforcement learning training of large language models. The dataset includes test-case-based ground truth for code execution verification. Each problem is organized in the VERL format, including data source, prompt (role and content), ability, reward model (evaluation approach and ground truth), and extra info (record identifier and original question identifier).
提供机构:
sungyub



