five

Sidsidney/general-reasoning-ift-pairs

收藏
Hugging Face2025-12-14 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Sidsidney/general-reasoning-ift-pairs
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: prompt dtype: string - name: reasoning dtype: string - name: ift dtype: string splits: - name: reasoning_ift_pairs num_bytes: 12536466590 num_examples: 990098 - name: reasoning num_bytes: 10814569775 num_examples: 990098 - name: ift num_bytes: 2274571843 num_examples: 990098 download_size: 12132970012 dataset_size: 25625608208 configs: - config_name: default data_files: - split: reasoning_ift_pairs path: data/reasoning_ift_pairs-* - split: reasoning path: data/reasoning-* - split: ift path: data/ift-* license: mit task_categories: - question-answering - text-generation language: - en tags: - instruction-tuning - reasoning - synthetic pretty_name: IFT & Reasoning Paired Dataset size_categories: - 1M<n<10M --- # Reasoning-IFT Pairs (General Domain) <p align="left"> <img src="https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/GjJ15tY7-F4bqR96FN4pd.png" alt="Dataset Icon" width="180"/> </p> <p align="left"> <a href="https://arxiv.org/pdf/2509.22193" target="_blank" rel="noopener noreferrer"> <img src="https://img.shields.io/badge/arXiv-2509.22193-b31b1b.svg?style=for-the-badge" alt="arXiv:2509.22193" /> </a> </p> This dataset provides **the largest set of IFT and Reasoning answers pairs** for a set of general domain queries (cf: [math-domain](https://huggingface.co/datasets/When-Does-Reasoning-Matter/math-reasoning-ift-pairs)).<br> It is based on the `Infinity-Instruct` dataset, an extensive and high-quality collection of instruction fine-tuning data. We curated **900k queries** from the `7M_core` subset of Infinity-Instruct, which covers multiple domains including general knowledge, commonsense Q&A, coding, and math. For each query, we used [Qwen/Qwen3-235B-A22B](https://huggingface.co/Qwen/Qwen3-235B-A22B), which supports a configurable reasoning flag, to generate two answer formats: - **IFT Answer** → concise, direct response - **Reasoning Answer** → response with reasoning mode enabled (chain-of-thought style) If you use this dataset in your work, please cite: **[When Does Reasoning Matter?](https://arxiv.org/pdf/2509.22193)** ```bibtex @misc{boizard2025doesreasoningmattercontrolled, title={When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance}, author={Nicolas Boizard and Hippolyte Gisserot-Boukhlef and Kevin El-Haddad and Céline Hudelot and Pierre Colombo}, year={2025}, eprint={2509.22193}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2509.22193}, } ``` --- ## 📂 Dataset Details - **Source**: Based on *Infinity-Instruct* (`7M_core` subset) - **Size**: ~900k query–answer pairs - **Format**: Each entry contains: - `prompt`: input question - `reasoning`: synthetic answer with reasoning enabled - `ift`: synthetic answer without reasoning - **Model used for generation**: `Qwen/Qwen3-235B-A22B` (open-weight, mixture-of-experts, reasoning toggle) --- ## 🎯 Research Motivation Frontier research initiatives highlight the potential of reasoning models, but progress is often confounded by opaque data mixtures and shifting supervision schemes. This dataset moves the needle by isolating reasoning itself: - Using a single teacher model to generate paired IFT and reasoning answers for the same queries, we enable clean attribution of performance improvements specifically to reasoning. - This controlled setup avoids reliance on expensive RL pipelines (e.g. Magistral, Qwen3). - It facilitates systematic study across model scales and data domains.

dataset_info: 数据集信息 features: 特征项 - name: 提示词(prompt) dtype: 字符串(string) - name: 推理过程(reasoning) dtype: 字符串(string) - name: 指令微调样本(ift, Instruction Fine-Tuning) dtype: 字符串(string) splits: 数据划分 - name: 推理-指令微调配对样本(reasoning_ift_pairs) num_bytes: 字节数:12536466590 num_examples: 样本数量:990098 - name: 仅推理样本(reasoning) num_bytes: 字节数:10814569775 num_examples: 样本数量:990098 - name: 仅指令微调样本(ift) num_bytes: 字节数:2274571843 num_examples: 样本数量:990098 download_size: 下载大小:12132970012 dataset_size: 数据集总大小:25625608208 configs: 配置项 - config_name: 默认配置(default) data_files: 数据文件 - split: 推理-指令微调配对样本(reasoning_ift_pairs) path: data/reasoning_ift_pairs-* - split: 仅推理样本(reasoning) path: data/reasoning-* - split: 仅指令微调样本(ift) path: data/ift-* license: 许可证:MIT许可证 task_categories: 任务类别 - 问答(question-answering) - 文本生成(text-generation) language: 语言:英语(en) tags: 标签 - 指令微调(instruction-tuning) - 推理(reasoning) - 合成数据集(synthetic) pretty_name: 易读名称:IFT与推理配对数据集(IFT & Reasoning Paired Dataset) size_categories: 样本规模类别:100万<样本数<1000万 # 通用领域推理-指令微调配对数据集 <p align="left"> <img src="https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/GjJ15tY7-F4bqR96FN4pd.png" alt="数据集图标" width="180"/> </p> <p align="left"> <a href="https://arxiv.org/pdf/2509.22193" target="_blank" rel="noopener noreferrer"> <img src="https://img.shields.io/badge/arXiv-2509.22193-b31b1b.svg?style=for-the-badge" alt="arXiv:2509.22193" /> </a> </p> 本数据集为通用领域查询集提供了**目前规模最大的指令微调(IFT)与推理答案配对样本集**(可对比参考数学领域数据集:[math-domain](https://huggingface.co/datasets/When-Does-Reasoning-Matter/math-reasoning-ift-pairs))。 本数据集基于`Infinity-Instruct`数据集构建,后者是一个规模庞大、质量上乘的指令微调数据合集。 我们从`Infinity-Instruct`的`7M_core`子集中精选了**90万个查询样本**,该子集覆盖多个领域,包括常识知识、常识问答、编程与数学等。 针对每个查询,我们使用[Qwen/Qwen3-235B-A22B](https://huggingface.co/Qwen/Qwen3-235B-A22B)生成两种格式的答案,该模型支持可配置的推理开关: - **指令微调答案**:简洁直接的应答内容 - **推理答案**:启用推理模式生成的应答(即思维链(Chain-of-Thought, CoT)风格) 若您在研究中使用本数据集,请引用:**[《推理何时发挥作用?》(When Does Reasoning Matter?)](https://arxiv.org/pdf/2509.22193)** bibtex @misc{boizard2025doesreasoningmattercontrolled, title={When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance}, author={Nicolas Boizard and Hippolyte Gisserot-Boukhlef and Kevin El-Haddad and Céline Hudelot and Pierre Colombo}, year={2025}, eprint={2509.22193}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2509.22193}, } --- ## 📂 数据集详情 - **数据来源**:基于*Infinity-Instruct*的`7M_core`子集 - **数据集规模**:约90万个查询-应答配对样本 - **数据格式**:每条样本包含以下字段: - `提示词(prompt)`:输入问题 - `推理过程(reasoning)`:启用推理模式生成的合成应答 - `指令微调样本(ift)`:未启用推理模式生成的合成应答 - **生成所用模型**:`Qwen/Qwen3-235B-A22B`(开源权重、混合专家架构、支持推理开关) --- ## 🎯 研究动机 前沿研究成果凸显了推理模型的应用潜力,但现有进展常因不透明的数据集混合方式与多变的监督机制而难以归因。 本数据集通过将推理本身作为独立变量,突破了这一研究瓶颈: 1. 使用单一教师模型为同一查询生成配对的指令微调答案与推理答案,可精准将模型性能提升归因于推理模块的作用。 2. 这种可控的实验设置无需依赖成本高昂的强化学习流水线(如Magistral、Qwen3系列模型)。 3. 可支持针对不同模型规模与数据领域的系统性研究。
提供机构:
Sidsidney
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作