GeneralThought-323K
收藏魔搭社区2025-11-23 更新2025-03-15 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/GeneralThought-323K
下载链接
链接失效反馈官方服务:
资源简介:
<img src="https://cdn-media.gr.inc/logoonly.png" alt="General Reasoning" width="150">
# GeneralThought-323K
**NEWEST RELEASE WITH 430K TRACES IS [HERE](https://huggingface.co/datasets/GeneralReasoning/GeneralThought-430K)**
> Thought wants to be free
Open reasoning data from the [General Reasoning](https://gr.inc) resource for March 8 2025.
The dataset contains questions, reference answers, reasoning traces, final answers and other metadata from several popular reasoning models including [DeepSeek-R1](https://gr.inc/DeepSeek/models/DeepSeek-R1/), [DeepSeek-R1-Zero](https://gr.inc/DeepSeek/models/DeepSeek-R1-Zero/), [OpenThoughts-32B](https://gr.inc/open-thoughts/models/OpenThinker-32B/), [LIMO](https://gr.inc/GAIR-NLP/models/LIMO/), [deepseek-r1-distill-llama-70b](https://gr.inc/DeepSeek/models/deepseek-r1-distill-llama-70b/), [DeepHermes-3-Llama-3-8B-Preview](https://gr.inc/NousResearch/models/DeepHermes-3-Llama-3-8B-Preview/) and [DeepScaleR-1.5B-Preview](https://gr.inc/agentica-org/models/DeepScaleR-1.5B-Preview/). We also include final answers from [o3-mini-2025-01-31](https://gr.inc/OpenAI/models/o3-mini-2025-01-31/), [gemini-2-flash-thinking-exp-01-21](https://gr.inc/Google/models/gemini-2-flash-thinking-exp-01-21/) and [claude-3-7-sonnet-20250219](https://gr.inc/Anthropic/models/claude-3-7-sonnet-20250219/) for comparison and evaluation. This release has 323k rows of data.
## Improvements
The main improvement in this update is trace diversity. There are many more reasoning traces beyond mathematics and code, including the natural sciences, humanities, social sciences, and general conversations.
## Metadata
A row of data contains the following information:
```python
row = {
'question_id': '296582', # question ID on the GR resource
'question_url': 'https://gr.inc/question/of-a-quadrilateral-if-its-angle-measures-are-in-the-ratio-of-3456-find-the-m', # URL on gr.inc
'question': 'Of a quadrilateral if its angle measures are in the ratio of 3:4:5:6, find the measure of each angle.', # Question text
'prev_messages': None, # previous messages in the conversation
'reference_answer': '60°, 80°, 100°, 120°', # Reference answer
'model_name': 'DeepSeek/DeepSeek-R1', # The model that generated the trace
'model_answer': 'The measures of the angles in the quadrilateral are calculated as follows...', # the model final answer text
'model_reasoning': 'Okay, so I need to find the measures of each angle in a quadrilateral...' # the model reasoning text
'task': 'Applying Ratios to Angle-Measure Sums', # name of the task on GR
'question_license': 'MIT', # license of the question
'question_source': 'General/VNet', # original dataset source or author on GR
'community_question_score': 0 # community score for the question on GR; negative means downvoted, positive upvoted,
'community_answer_score': 0, # community score for the answer on GR; negative means downvoted, positive upvoted
'verifier_score': 1.0 # an average verification score between 0-1; if multiple verifiers, this could be between, e.g. 0.5 if one verifier marks as correct, another incorrect
}
```
## How can I use the data?
The dataset is a great complement to [OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k), [OpenR1](https://huggingface.co/datasets/open-r1/OpenR1-Math-Raw), [SYNTHETIC-1](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-1), among others!
Here's some example use cases for our dataset:
- Perform SFT distillation and use it to train a small reasoning model.
- Ablate alongside datasets from our open source friends (and see if cross-group diversity helps).
- Analyse reasoning differences between models: reasoning length, language switching, and use of connectors like "wait" and "alternatively".
The verification side of the GR resource is still early, so we would recommend focusing on distillation (and rejection sampling) rather than online RL for now.
## Thanks
Thank you to the contributors of questions for this dataset:
First - thanks to the questions we sourced from folks like [Numina](https://huggingface.co/datasets/AI-MO/NuminaMath-CoT), [SCP-116k](https://huggingface.co/datasets/EricLu/SCP-116K), [natural_reasoning](https://huggingface.co/datasets/facebook/natural_reasoning) and others! We've credited you in the question_source field of each row of the dataset.
Thanks to GR community contributors who contributed:
- Jarius, otaldohenrikk, knight_raider, supahao, alpam, Esac, gonros, tomsercu, ryan, sidoneytemporary977, panpan, Tim_tom_0, arpitg1991, Doge, tginart, pcpthm, eli5, yych, caijie, yuchen.zhang2003, lockon, susheelt, wangxinjing, duyiyang, Slimane, FABGYUXIN, chendarcy, Sin, robintan, imhillxtz, navinahc, z, zhangdapao, yixiangRDS500
Going forward we will continue to credit those who contribute questions in future data dumps on Hugging Face 🤗.
**We will look to publish a paper with co-authorship for contributors.**
<img src="https://cdn-media.gr.inc/logoonly.png" alt="General Reasoning" width="150">
# GeneralThought-323K
**最新发布的含43万条推理轨迹的版本已上线:[点击前往](https://huggingface.co/datasets/GeneralReasoning/GeneralThought-430K)**
> 思想本应自由
本数据集为2025年3月8日发布的General Reasoning(通用推理,GR)资源下的开放推理数据。
本数据集涵盖了多款主流推理模型生成的问题、参考答案、推理轨迹、最终答案及其他元数据,涉及的模型包括DeepSeek-R1、DeepSeek-R1-Zero、OpenThoughts-32B、LIMO、deepseek-r1-distill-llama-70b、DeepHermes-3-Llama-3-8B-Preview以及DeepScaleR-1.5B-Preview。此外,为便于对比与评估,本数据集还收录了o3-mini-2025-01-31、gemini-2-flash-thinking-exp-01-21及claude-3-7-sonnet-20250219的最终答案。本次发布的数据共计32.3万条。
## 本次更新改进
本次更新的核心改进在于推理轨迹的多样性。本次新增了大量覆盖自然科学、人文社科、日常对话等非数学与代码领域的推理轨迹。
## 元数据字段说明
单条数据包含以下字段:
python
row = {
'question_id': '296582', # GR资源上的问题唯一标识符
'question_url': 'https://gr.inc/question/of-a-quadrilateral-if-its-angle-measures-are-in-the-ratio-of-3456-find-the-m', # gr.inc平台上的问题链接
'question': 'Of a quadrilateral if its angle measures are in the ratio of 3:4:5:6, find the measure of each angle.', # 问题原文
'prev_messages': None, # 对话历史消息
'reference_answer': '60°, 80°, 100°, 120°', # 参考答案
'model_name': 'DeepSeek/DeepSeek-R1', # 生成该推理轨迹的模型名称
'model_answer': 'The measures of the angles in the quadrilateral are calculated as follows...', # 模型最终回答文本
'model_reasoning': 'Okay, so I need to find the measures of each angle in a quadrilateral...' # 模型推理过程文本
'task': 'Applying Ratios to Angle-Measure Sums', # GR资源上的任务名称
'question_license': 'MIT', # 问题授权协议
'question_source': 'General/VNet', # GR资源上的原始数据集来源或作者
'community_question_score': 0 # GR资源上的社区问题评分,负值代表差评,正值代表好评
'community_answer_score': 0, # GR资源上的社区回答评分,负值代表差评,正值代表好评
'verifier_score': 1.0 # 0-1区间的平均验证评分;若存在多名验证者,例如一名验证者标记正确、一名标记错误,则评分可为0.5
}
## 数据集使用场景
本数据集可作为OpenThoughts-114k、OpenR1、SYNTHETIC-1等多款公开数据集的优质补充。
以下为本数据集的典型应用场景:
- 执行监督微调(SFT)蒸馏,用于训练小型推理模型。
- 与开源伙伴的数据集开展消融对照实验,探究跨群体多样性是否能提升模型性能。
- 分析不同模型间的推理差异:包括推理长度、语言切换,以及“等一下”“或者说”这类连接词的使用习惯。
目前GR资源的验证模块仍处于早期阶段,因此我们建议现阶段优先关注蒸馏(及拒绝采样)技术,而非在线强化学习(RL)。
## 致谢
谨为本数据集的问题贡献者致谢:
首先,感谢我们从Numina、SCP-116k、natural_reasoning等来源获取的问题!我们已在数据集每条数据的`question_source`字段中为您署名。
同时感谢GR社区的以下贡献者:
Jarius, otaldohenrikk, knight_raider, supahao, alpam, Esac, gonros, tomsercu, ryan, sidoneytemporary977, panpan, Tim_tom_0, arpitg1991, Doge, tginart, pcpthm, eli5, yych, caijie, yuchen.zhang2003, lockon, susheelt, wangxinjing, duyiyang, Slimane, FABGYUXIN, chendarcy, Sin, robintan, imhillxtz, navinahc, z, zhangdapao, yixiangRDS500
未来我们将继续在Hugging Face 🤗发布的所有数据集版本中,为所有问题贡献者署名。
**我们计划为所有贡献者安排共同作者署名,并发表相关学术论文。**
提供机构:
maas
创建时间:
2025-03-15



