reasoning-base-20k

Name: reasoning-base-20k
Creator: maas
Published: 2025-12-04 16:17:29
License: 暂无描述

魔搭社区2025-12-04 更新2024-10-12 收录

下载链接：

https://modelscope.cn/datasets/AI-ModelScope/reasoning-base-20k

下载链接

链接失效反馈

官方服务：

资源简介：

# Dataset Card for Reasoning Base 20k ## Dataset Details ### Dataset Description This dataset is designed to train a reasoning model. That can think through complex problems before providing a response, similar to how a human would. The dataset includes a wide range of problems from various domains (science, coding, math, etc.), each with a detailed chain of thought (COT) and the correct answer. The goal is to enable the model to learn and refine its reasoning process, recognize and correct mistakes, and provide high-quality, detailed responses. This dataset is currently in-progress. - **Curated by:** [Nishith Jain](https://huggingface.co/KingNish) - **Language(s) (NLP):** English - **License:** Apache-2.0 - **Chat Template**: RChatML ```python {%- for message in messages %} {{- '<|im_start|>' + message['role'] + '\n' }} {{- message['content'] + eos_token + '\n' }} {%- endfor %} {%- if add_generation_prompt %} {{- '<|im_start|>assistant\n' }} {%- endif %} {%- if add_reasoning_prompt %} {{- '<|im_start|>reasoning\n' }} {%- endif %} ``` ## Uses ### Direct Use - **Model Training**: Train reasoning models to improve their ability to think through complex problems. - **Research**: Study the effectiveness of different reasoning strategies and techniques. ## Dataset Structure ### Data Fields - **user**: The user's query or problem statement. - **assistant**: The correct answer to the problem. - **reasoning**: A detailed, step-by-step reasoning process that explains how to arrive at the correct answer. - **template**: A preapplied RChatML chat template.

# 推理基础20k数据集卡片 ## 数据集详情 ### 数据集描述本数据集旨在训练推理模型，使其能够像人类一样，在生成回复前对复杂问题进行逐步思考推导。本数据集涵盖科学、编程、数学等多个领域的多样化问题，每个问题均附带详细的思维链（Chain of Thought, COT）与正确答案。本数据集的目标是帮助模型学习并优化其推理流程，识别并修正错误，最终生成高质量且详尽的回复。当前本数据集仍处于开发完善阶段。 - **整理方：** [Nishith Jain](https://huggingface.co/KingNish) - **自然语言处理所用语言：** 英语 - **许可证：** Apache-2.0 - **对话模板：** RChatML python {%- for message in messages %} {{- '<|im_start|>' + message['role'] + ' ' }} {{- message['content'] + eos_token + ' ' }} {%- endfor %} {%- if add_generation_prompt %} {{- '<|im_start|>assistant ' }} {%- endif %} {%- if add_reasoning_prompt %} {{- '<|im_start|>reasoning ' }} {%- endif %} ## 用途 ### 直接用途 - **模型训练：** 训练推理模型，提升其处理复杂问题的思考推导能力。 - **研究：** 探究不同推理策略与技术的有效性。 ## 数据集结构 ### 数据字段 - **user：** 用户的查询语句或问题描述 - **assistant：** 问题的正确答案 - **reasoning：** 用于阐释正确答案推导逻辑的详尽分步推理过程 - **template：** 已预应用的RChatML对话模板。

提供机构：

maas

创建时间：

2024-10-07

搜集汇总

数据集介绍

背景与挑战

背景概述

该数据集名为'reasoning-base-20k'，旨在训练推理模型，使其能够像人类一样处理复杂问题。它包含来自科学、编码、数学等多个领域的问题，每个问题均配有详细的思维链和正确答案，以帮助模型学习推理过程、纠正错误并生成高质量响应。数据集由Nishith Jain策划，使用英语，采用Apache-2.0许可证。

以上内容由遇见数据集搜集并总结生成