five

GeneralThought-430K

收藏
魔搭社区2025-12-05 更新2025-03-22 收录
下载链接:
https://modelscope.cn/datasets/GeneralReasoning/GeneralThought-430K
下载链接
链接失效反馈
官方服务:
资源简介:
<img src="https://cdn-media.gr.inc/logoonly.png" alt="General Reasoning" width="150"> # GeneralThought-430K > Thought wants to be free Open reasoning data from the [General Reasoning](https://gr.inc) resource for March 14 2025. The dataset contains questions, reference answers, reasoning traces, final answers and other metadata from several popular reasoning models including [DeepSeek-R1](https://gr.inc/DeepSeek/models/DeepSeek-R1/), [DeepSeek-R1-Zero](https://gr.inc/DeepSeek/models/DeepSeek-R1-Zero/), [OpenThoughts-32B](https://gr.inc/open-thoughts/models/OpenThinker-32B/), [LIMO](https://gr.inc/GAIR-NLP/models/LIMO/), [deepseek-r1-distill-llama-70b](https://gr.inc/DeepSeek/models/deepseek-r1-distill-llama-70b/), [DeepHermes-3-Llama-3-8B-Preview](https://gr.inc/NousResearch/models/DeepHermes-3-Llama-3-8B-Preview/) and [DeepScaleR-1.5B-Preview](https://gr.inc/agentica-org/models/DeepScaleR-1.5B-Preview/). We also include final answers from [o3-mini-2025-01-31](https://gr.inc/OpenAI/models/o3-mini-2025-01-31/), [gemini-2-flash-thinking-exp-01-21](https://gr.inc/Google/models/gemini-2-flash-thinking-exp-01-21/) and [claude-3-7-sonnet-20250219](https://gr.inc/Anthropic/models/claude-3-7-sonnet-20250219/) for comparison and evaluation. This release has 430k rows of data. ## Improvements The main improvement in this update is trace diversity. There are many more reasoning traces beyond mathematics and code, including the natural sciences, humanities, social sciences, and general conversations. ## Metadata A row of data contains the following information: ```python row = { 'question_id': '296582', # question ID on the GR resource 'question_url': 'https://gr.inc/question/of-a-quadrilateral-if-its-angle-measures-are-in-the-ratio-of-3456-find-the-m', # URL on gr.inc 'question': 'Of a quadrilateral if its angle measures are in the ratio of 3:4:5:6, find the measure of each angle.', # Question text 'prev_messages': None, # previous messages in the conversation 'reference_answer': '60°, 80°, 100°, 120°', # Reference answer 'model_name': 'DeepSeek/DeepSeek-R1', # The model that generated the trace 'model_answer': 'The measures of the angles in the quadrilateral are calculated as follows...', # the model final answer text 'model_reasoning': 'Okay, so I need to find the measures of each angle in a quadrilateral...' # the model reasoning text 'task': 'Applying Ratios to Angle-Measure Sums', # name of the task on GR 'question_license': 'MIT', # license of the question 'question_source': 'General/VNet', # original dataset source or author on GR 'community_question_score': 0 # community score for the question on GR; negative means downvoted, positive upvoted, 'community_answer_score': 0, # community score for the answer on GR; negative means downvoted, positive upvoted 'verifier_score': 1.0 # an average verification score between 0-1; if multiple verifiers, this could be between, e.g. 0.5 if one verifier marks as correct, another incorrect } ``` ## How can I use the data? The dataset is a great complement to [OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k), [OpenR1](https://huggingface.co/datasets/open-r1/OpenR1-Math-Raw), [SYNTHETIC-1](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-1), among others! Here's some example use cases for our dataset: - Perform SFT distillation and use it to train a small reasoning model. - Ablate alongside datasets from our open source friends (and see if cross-group diversity helps). - Analyse reasoning differences between models: reasoning length, language switching, and use of connectors like "wait" and "alternatively". The verification side of the GR resource is still early, so we would recommend focusing on distillation (and rejection sampling) rather than online RL for now. ## Thanks Thank you to the contributors of questions for this dataset: First - thanks to the questions we sourced from folks like [Numina](https://huggingface.co/datasets/AI-MO/NuminaMath-CoT), [SCP-116k](https://huggingface.co/datasets/EricLu/SCP-116K), [natural_reasoning](https://huggingface.co/datasets/facebook/natural_reasoning) and others! We've credited you in the question_source field of each row of the dataset. Thanks to GR community contributors who contributed: - Jarius, otaldohenrikk, knight_raider, supahao, alpam, Esac, gonros, tomsercu, ryan, sidoneytemporary977, panpan, Tim_tom_0, arpitg1991, Doge, tginart, pcpthm, eli5, yych, caijie, yuchen.zhang2003, lockon, susheelt, wangxinjing, duyiyang, Slimane, FABGYUXIN, chendarcy, Sin, robintan, imhillxtz, navinahc, z, zhangdapao, yixiangRDS500 Going forward we will continue to credit those who contribute questions in future data dumps on Hugging Face 🤗. **We will look to publish a paper with co-authorship for contributors.**

![General Reasoning(通用推理)](https://cdn-media.gr.inc/logoonly.png) # GeneralThought-430K > 思想本应自由 本数据集为2025年3月14日发布的[General Reasoning(通用推理平台)](https://gr.inc)开源推理数据资源。 数据集包含来自多款主流推理模型的问题、参考答案、推理轨迹、最终答案及其他元数据,涉及的模型包括[DeepSeek-R1](https://gr.inc/DeepSeek/models/DeepSeek-R1/)、[DeepSeek-R1-Zero](https://gr.inc/DeepSeek/models/DeepSeek-R1-Zero/)、[OpenThoughts-32B](https://gr.inc/open-thoughts/models/OpenThinker-32B/)、[LIMO](https://gr.inc/GAIR-NLP/models/LIMO/)、[deepseek-r1-distill-llama-70b](https://gr.inc/DeepSeek/models/deepseek-r1-distill-llama-70b/)、[DeepHermes-3-Llama-3-8B-Preview](https://gr.inc/NousResearch/models/DeepHermes-3-Llama-3-8B-Preview/)以及[DeepScaleR-1.5B-Preview](https://gr.inc/agentica-org/models/DeepScaleR-1.5B-Preview/)。此外,本数据集还收录了[o3-mini-2025-01-31](https://gr.inc/OpenAI/models/o3-mini-2025-01-31/)、[gemini-2-flash-thinking-exp-01-21](https://gr.inc/Google/models/gemini-2-flash-thinking-exp-01-21/)与[claude-3-7-sonnet-20250219](https://gr.inc/Anthropic/models/claude-3-7-sonnet-20250219/)的最终答案,用于对比与评估。本次发布的数据共包含43万条记录。 ## 优化亮点 本次更新的核心优化在于推理轨迹的多样性。除数学与代码领域外,本数据集新增了大量覆盖自然科学、人文社科以及日常对话的推理轨迹样本。 ## 元数据字段说明 单条数据记录包含以下信息: python row = { 'question_id': '296582', # 该问题在General Reasoning平台上的唯一标识符 'question_url': 'https://gr.inc/question/of-a-quadrilateral-if-its-angle-measures-are-in-the-ratio-of-3456-find-the-m', # gr.inc平台上的问题链接 'question': 'Of a quadrilateral if its angle measures are in the ratio of 3:4:5:6, find the measure of each angle.', # 问题文本 'prev_messages': None, # 对话历史消息 'reference_answer': '60°, 80°, 100°, 120°', # 参考答案 'model_name': 'DeepSeek/DeepSeek-R1', # 生成该推理轨迹的模型名称 'model_answer': 'The measures of the angles in the quadrilateral are calculated as follows...', # 模型最终答案文本 'model_reasoning': 'Okay, so I need to find the measures of each angle in a quadrilateral...' # 模型推理过程文本 'task': 'Applying Ratios to Angle-Measure Sums', # 该任务在General Reasoning平台上的名称 'question_license': 'MIT', # 问题授权协议 'question_source': 'General/VNet', # 该问题的原始数据集来源或平台作者信息 'community_question_score': 0 # 该问题在General Reasoning平台的社区评分;负值代表差评,正值代表好评 'community_answer_score': 0, # 该答案在General Reasoning平台的社区评分;负值代表差评,正值代表好评 'verifier_score': 1.0 # 平均验证评分,取值范围0至1;若存在多名验证者,评分可能为区间值,例如0.5代表一名验证者判定正确、另一名判定错误 } ## 数据集使用场景 本数据集可作为[OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k)、[OpenR1](https://huggingface.co/datasets/open-r1/OpenR1-Math-Raw)、[SYNTHETIC-1](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-1)等开源数据集的优质补充。 以下为本数据集的典型应用场景: - 开展监督微调(SFT)蒸馏,用于训练小型推理模型。 - 与其他开源数据集开展消融实验,探究跨群体多样性对模型性能的提升效果。 - 分析不同模型间的推理差异:如推理长度、语言切换以及“等等”“或者”等连接词的使用习惯。 目前General Reasoning平台的验证功能仍处于早期阶段,因此我们建议现阶段优先关注监督微调蒸馏(及拒绝采样)相关任务,暂不建议直接用于在线强化学习(RL)。 ## 致谢 感谢为本数据集贡献问题的所有参与者: 首先,感谢我们从[Numina](https://huggingface.co/datasets/AI-MO/NuminaMath-CoT)、[SCP-116k](https://huggingface.co/datasets/EricLu/SCP-116K)、[natural_reasoning](https://huggingface.co/datasets/facebook/natural_reasoning)等资源中获取的问题样本,我们已在每条数据记录的`question_source`字段中为相关贡献者标注了署名。 感谢General Reasoning社区的以下贡献者: Jarius, otaldohenrikk, knight_raider, supahao, alpam, Esac, gonros, tomsercu, ryan, sidoneytemporary977, panpan, Tim_tom_0, arpitg1991, Doge, tginart, pcpthm, eli5, yych, caijie, yuchen.zhang2003, lockon, susheelt, wangxinjing, duyiyang, Slimane, FABGYUXIN, chendarcy, Sin, robintan, imhillxtz, navinahc, z, zhangdapao, yixiangRDS500 后续我们将在Hugging Face 🤗 发布的所有数据集版本中持续为贡献者标注署名。 **我们将为符合条件的贡献者提供论文共同作者资格。**
提供机构:
maas
创建时间:
2025-03-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作