five

OmniThought-0528

收藏
魔搭社区2026-01-09 更新2025-06-28 收录
下载链接:
https://modelscope.cn/datasets/PAI/OmniThought-0528
下载链接
链接失效反馈
官方服务:
资源简介:
# OmniThought-0528 *A High-Quality Chain-of-Thought (CoT) Dataset for Enhanced Model Distillation* ## **Overview** OmniThought-0528 is an advanced version of the **OmniThought** dataset, designed to enhance reasoning capabilities in language models through high-quality **Chain-of-Thought (CoT)** distillation. It consists of **365,000 reasoning chains** across diverse domains, including **mathematics, coding, and science**, generated and rigorously validated using state-of-the-art teacher models. ## **Dataset Construction** ### **Data Sources & Processing** - **Input Problems**: Collected from multiple public sources, covering **mathematical, scientific, and programming reasoning tasks**. - **Teacher Models**: Generated using **DeepSeek-R1** and **QwQ-32B**, then filtered via **"LLM-as-a-Judge"** validation to ensure correctness. - **Quality Control**: Only verified reasoning chains were retained, ensuring high reliability. ### **Annotations & Metrics** Each reasoning chain is annotated with two key metrics: | **Score** | **Reasoning Verbosity (RV)** | **Cognitive Difficulty (CD)** | |-----------|-----------------------------|-------------------------------| | **0-1** | Minimal explanation, direct answers. | Elementary knowledge, single-step reasoning. | | **2-3** | Concise reasoning with necessary steps. | Multi-step arithmetic, rule-based logic. | | **4-5** | Detailed explanations, thorough reasoning. | Basic algebra, non-trivial inference. | | **6-7** | Comprehensive exploration, advanced techniques. | Higher-level methods (e.g., dynamic programming, proofs). | | **8-9** | Deep, exhaustive reasoning with nested arguments. | Highly abstract (e.g., complex algorithm analysis). | These metrics help **match reasoning complexity with model size**—smaller models benefit from simpler chains, while larger models leverage advanced reasoning. ## **Key Features** - **Enhanced Teacher Model**: Generated using **DeepSeek-R1-0528**, a top-tier model (comparable to **GPT-4o & Gemini-2.5-Pro**) excelling in **math, coding, and logic benchmarks**. - **Structured Format**: Each entry includes: ```json { "question": "problem_text", "reasoning": [ { "full_response": "detailed_answer", "Cognitive_Difficulty": score, "Reasoning_Verbosity": score }, ... ] } ```

# OmniThought-0528 *面向模型蒸馏优化的高质量思维链(Chain-of-Thought, CoT)数据集* ## **概述** OmniThought-0528 是 **OmniThought** 数据集的进阶版本,旨在通过高质量思维链(Chain-of-Thought,CoT)蒸馏提升语言模型的推理能力。该数据集涵盖数学、编码与科学等多元领域的365,000条推理链,由当前顶尖的教师模型生成并经过严格验证。 ## **数据集构建** ### **数据来源与处理流程** - **输入问题**:采集自多个公开来源,覆盖数学、科学与编程推理任务。 - **教师模型**:由 DeepSeek-R1 与 QwQ-32B 生成,随后通过「LLM-as-a-Judge」验证流程进行筛选,以确保结果正确性。 - **质量管控**:仅保留经验证的推理链,保障数据集的高可靠性。 ### **标注与评价指标** 每条推理链均配有两项核心指标: | 得分区间 | 推理详细度(Reasoning Verbosity, RV) | 认知难度(Cognitive Difficulty, CD) | |---------|--------------------------------------|--------------------------------------| | 0-1 | 解释极简,直接给出答案 | 基础知识点,单步推理 | | 2-3 | 包含必要步骤的简洁推理 | 多步算术、基于规则的逻辑推理 | | 4-5 | 详细解释,推理过程完整 | 基础代数、非平凡推理 | | 6-7 | 全面探索,运用高级技术 | 高阶方法(如动态规划、证明推导) | | 8-9 | 深度穷尽式推理,包含嵌套论证 | 高度抽象(如复杂算法分析) | 上述指标可用于**匹配推理复杂度与模型规模**——小型模型适配更简单的推理链,而大型模型则可利用高阶推理内容。 ## **核心特性** - **增强型教师模型**:由 DeepSeek-R1-0528 生成,该模型为顶尖模型(可与 GPT-4o 及 Gemini-2.5-Pro 媲美),在数学、编码与逻辑基准测试中表现优异。 - **结构化格式**:每条数据条目包含如下内容: json { "question": "problem_text", "reasoning": [ { "full_response": "detailed_answer", "Cognitive_Difficulty": score, "Reasoning_Verbosity": score }, ... ] }
提供机构:
maas
创建时间:
2025-06-24
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
OmniThought-0528是一个高质量链式思考数据集,旨在通过蒸馏提升语言模型的推理能力。它包含365,000个经过验证的推理链,覆盖数学、编码和科学等多个领域,由先进的教师模型生成并标注了认知难度和推理详细度指标。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作