OmniThought-0528

Name: OmniThought-0528
Creator: maas
Published: 2026-01-09 11:27:48
License: 暂无描述

魔搭社区2026-01-09 更新2025-06-28 收录

下载链接：

https://modelscope.cn/datasets/PAI/OmniThought-0528

下载链接

链接失效反馈

官方服务：

资源简介：

# OmniThought-0528 *A High-Quality Chain-of-Thought (CoT) Dataset for Enhanced Model Distillation* ## **Overview** OmniThought-0528 is an advanced version of the **OmniThought** dataset, designed to enhance reasoning capabilities in language models through high-quality **Chain-of-Thought (CoT)** distillation. It consists of **365,000 reasoning chains** across diverse domains, including **mathematics, coding, and science**, generated and rigorously validated using state-of-the-art teacher models. ## **Dataset Construction** ### **Data Sources & Processing** - **Input Problems**: Collected from multiple public sources, covering **mathematical, scientific, and programming reasoning tasks**. - **Teacher Models**: Generated using **DeepSeek-R1** and **QwQ-32B**, then filtered via **"LLM-as-a-Judge"** validation to ensure correctness. - **Quality Control**: Only verified reasoning chains were retained, ensuring high reliability. ### **Annotations & Metrics** Each reasoning chain is annotated with two key metrics: | **Score** | **Reasoning Verbosity (RV)** | **Cognitive Difficulty (CD)** | |-----------|-----------------------------|-------------------------------| | **0-1** | Minimal explanation, direct answers. | Elementary knowledge, single-step reasoning. | | **2-3** | Concise reasoning with necessary steps. | Multi-step arithmetic, rule-based logic. | | **4-5** | Detailed explanations, thorough reasoning. | Basic algebra, non-trivial inference. | | **6-7** | Comprehensive exploration, advanced techniques. | Higher-level methods (e.g., dynamic programming, proofs). | | **8-9** | Deep, exhaustive reasoning with nested arguments. | Highly abstract (e.g., complex algorithm analysis). | These metrics help **match reasoning complexity with model size**—smaller models benefit from simpler chains, while larger models leverage advanced reasoning. ## **Key Features** - **Enhanced Teacher Model**: Generated using **DeepSeek-R1-0528**, a top-tier model (comparable to **GPT-4o & Gemini-2.5-Pro**) excelling in **math, coding, and logic benchmarks**. - **Structured Format**: Each entry includes: ```json { "question": "problem_text", "reasoning": [ { "full_response": "detailed_answer", "Cognitive_Difficulty": score, "Reasoning_Verbosity": score }, ... ] } ```

# OmniThought-0528 *面向模型蒸馏优化的高质量思维链（Chain-of-Thought, CoT）数据集* ## **概述** OmniThought-0528 是 **OmniThought** 数据集的进阶版本，旨在通过高质量思维链（Chain-of-Thought，CoT）蒸馏提升语言模型的推理能力。该数据集涵盖数学、编码与科学等多元领域的365,000条推理链，由当前顶尖的教师模型生成并经过严格验证。 ## **数据集构建** ### **数据来源与处理流程** - **输入问题**：采集自多个公开来源，覆盖数学、科学与编程推理任务。 - **教师模型**：由 DeepSeek-R1 与 QwQ-32B 生成，随后通过「LLM-as-a-Judge」验证流程进行筛选，以确保结果正确性。 - **质量管控**：仅保留经验证的推理链，保障数据集的高可靠性。 ### **标注与评价指标** 每条推理链均配有两项核心指标： | 得分区间 | 推理详细度（Reasoning Verbosity, RV） | 认知难度（Cognitive Difficulty, CD） | |---------|--------------------------------------|--------------------------------------| | 0-1 | 解释极简，直接给出答案 | 基础知识点，单步推理 | | 2-3 | 包含必要步骤的简洁推理 | 多步算术、基于规则的逻辑推理 | | 4-5 | 详细解释，推理过程完整 | 基础代数、非平凡推理 | | 6-7 | 全面探索，运用高级技术 | 高阶方法（如动态规划、证明推导） | | 8-9 | 深度穷尽式推理，包含嵌套论证 | 高度抽象（如复杂算法分析） | 上述指标可用于**匹配推理复杂度与模型规模**——小型模型适配更简单的推理链，而大型模型则可利用高阶推理内容。 ## **核心特性** - **增强型教师模型**：由 DeepSeek-R1-0528 生成，该模型为顶尖模型（可与 GPT-4o 及 Gemini-2.5-Pro 媲美），在数学、编码与逻辑基准测试中表现优异。 - **结构化格式**：每条数据条目包含如下内容： json { "question": "problem_text", "reasoning": [ { "full_response": "detailed_answer", "Cognitive_Difficulty": score, "Reasoning_Verbosity": score }, ... ] }

提供机构：

maas

创建时间：

2025-06-24

搜集汇总

数据集介绍