Open-Reasoner-Zero/orz_math_72k_collection_extended
收藏Hugging Face2025-04-06 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Open-Reasoner-Zero/orz_math_72k_collection_extended
下载链接
链接失效反馈官方服务:
资源简介:
Open-Reasoner-Zero是一个关注可扩展性、简洁性和可访问性的大规模推理导向的强化学习训练的开源实现。它包含了从AIME、MATH、Numina-Math collection和Tulu3 MATH等来源收集和扩展的57k原始数据和72k扩展数据,以及从中挖掘的13k难数据。
Open-Reasoner-Zero is an open-source implementation of large-scale reasoning-oriented RL training focusing on scalability, simplicity, and accessibility. It includes 57k original data and 72k extended data collected and expanded from sources such as AIME, MATH, Numina-Math collection, and Tulu3 MATH, as well as 13k hard data mined from them.
提供机构:
Open-Reasoner-Zero



