introvoyz041/Turing-Reason-CoT

Name: introvoyz041/Turing-Reason-CoT
Creator: introvoyz041
Published: 2026-02-23 07:52:42
License: 暂无描述

Hugging Face2026-02-23 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/introvoyz041/Turing-Reason-CoT

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - text-generation language: - en tags: - code - math - CoT - synthetic - biology - medical size_categories: - 1M<n<10M --- ![2.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/O9MWdf7m8nBfw9E0JaspD.png) # **Turing-Reason-CoT** > **Turing-Reason-CoT** is a high-quality, compact chain-of-thought reasoning dataset curated for tasks in mathematics, science, and coding. While the dataset spans diverse domains, it is primarily driven by mathematical reasoning, reflecting a major share of math-focused prompts and long-form logical solutions. ## Quick Start with Hugging Face Datasets🤗 ```py pip install -U datasets ``` ```py from datasets import load_dataset dataset = load_dataset("prithivMLmods/Turing-Reason-CoT", split="train") ``` ## Overview * **Total Samples**: \~ 4,993,463 * **Split**: `train` only * **Languages**: English * **Format**: Apache Arrow (auto-converted to Parquet) * **License**: Apache-2.0 * **Tags**: `math`, `code`, `science`, `reasoning`, `longcot` ## Highlights * Structured to promote **long-form, step-by-step reasoning**, ideal for training and evaluating chain-of-thought (CoT) capable models. * Reasoning traces include natural, human-like explanations for both simple and complex problems. * Fine-tuned across math word problems, logic-based questions, and technical prompts from STEM domains. ## Dataset Structure Each entry in the dataset includes: * **`problem`** (string): A math, science, or code problem. * **`solution`** (string): A detailed step-by-step solution crafted in a long-form reasoning style. The reasoning structure in solutions helps models understand logical flow, intermediate steps, and layered deductions—making this dataset suitable for advanced LLMs requiring interpretable outputs. ## Source & Derivation **Turing-Reason-CoT** is a derivative collection synthesized and optimized from: * A custom internal modular dataset tailored for logical and numeric reasoning tasks. * Chain-of-thought style responses generated with QwQ 32B-based models, carefully filtered and structured for quality. * Math chain-of-thoughts from PrimeIntellect/NuminaMath-QwQ-CoT-5M. This dataset was created with a focus on enhancing CoT capabilities in large-scale models working on math, science, and code. ## License Apache License 2.0

提供机构：

introvoyz041

5,000+

优质数据集

54 个

任务类型

进入经典数据集