big-reasoning-traces
收藏魔搭社区2025-10-09 更新2025-05-31 收录
下载链接:
https://modelscope.cn/datasets/allenai/big-reasoning-traces
下载链接
链接失效反馈官方服务:
资源简介:
A compiled dataset of large permissively licensed reasoning traces for experiments with midtraining / annealing before RL.
Total is ~2.5B tokens (2513520233) with OLMo 2 tokenizer.
Sources:
* [GeneralThought-430K](https://huggingface.co/datasets/GeneralReasoning/GeneralThought-430K) (removed NC license entries), 337579 examples, 696270809 tokens
* [OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k), 113957 examples, 766590712 tokens
* [OpenR1-Math-220k](https://huggingface.co/datasets/open-r1/OpenR1-Math-220k) (all), 225129 examples, 1050658712 tokens
Sript for reformatting in repo.
本数据集为经整理的大规模宽松许可推理轨迹数据集,用于开展强化学习(Reinforcement Learning, RL)前的训练中期调整/退火相关实验。数据集总Token(Token)数约为25亿(2513520233),采用OLMo 2分词器进行Token化处理。数据来源如下:
* [GeneralThought-430K](https://huggingface.co/datasets/GeneralReasoning/GeneralThought-430K):移除了非商业许可条目,包含337579条样本,共计696270809个Token(Token)。
* [OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k):包含113957条样本,共计766590712个Token(Token)。
* [OpenR1-Math-220k](https://huggingface.co/datasets/open-r1/OpenR1-Math-220k):采用全部数据集内容,包含225129条样本,共计1050658712个Token(Token)。
代码仓库中提供了格式重构脚本。
提供机构:
maas
创建时间:
2025-05-29



