Kwai-Klear/KlearReasoner-CodeSub-15K
收藏Hugging Face2025-09-27 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/Kwai-Klear/KlearReasoner-CodeSub-15K
下载链接
链接失效反馈官方服务:
资源简介:
这是一个来自Klear-Reasoner Code RL数据集的高质量子集,该数据集是从rllm项目使用的RL数据中派生出来的。这部分数据被用于训练Klear-Reasoner的代码推理模型。数据集经过仔细的清洗和筛选,只包含适用于强化学习的可靠样本。使用这个数据集训练的模型在各种代码推理基准测试中表现出显著的性能提升。
This dataset is a high-quality subset of the Klear-Reasoner Code RL dataset, derived from the RL data used in the rllm project. Part of this data contributed to training Klear-Reasoners code reasoning models. The dataset is carefully cleaned and filtered to include only reliable samples suitable for reinforcement learning. Models trained with this dataset have shown substantial performance improvements across various code reasoning benchmarks.
提供机构:
Kwai-Klear



