Math-IIO-68K-Mini
收藏魔搭社区2025-11-27 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/prithivMLmods/Math-IIO-68K-Mini
下载链接
链接失效反馈官方服务:
资源简介:
# Mathematics Dataset for AI Model Training
This dataset contains **68,000 rows** of mathematical questions and their corresponding solutions. It is designed for training AI models capable of solving mathematical problems or providing step-by-step explanations for a variety of mathematical concepts. The dataset is structured into three columns: **input**, **instruction**, and **output**.
## Dataset Overview
- **Input**: A mathematical question or problem statement (e.g., arithmetic, algebra, calculus, geometry, or mathematical constants).
- **Instruction**: A generic prompt instructing the AI to solve the problem and provide the correct answer.
- **Output**: The solution or response to the input question.
### Example Rows
| **Input** | **Instruction** | **Output** |
|-------------------------------------|------------------------------------------------------------------|-----------------------------------------|
| What is 15 + 32? | Solve the mathematical question and provide the correct answer. | The answer is 47. |
| Find the derivative of x² + 4x. | Solve the mathematical question and provide the correct answer. | The derivative is 2x + 4. |
| What is the area of a circle with radius 7? | Solve the mathematical question and provide the correct answer. | The area is approximately 153.938 square units. |
| Simplify the expression: 3x + 7x. | Solve the mathematical question and provide the correct answer. | The simplified expression is 10x. |
---
## Dataset Features
### Columns
1. **Input**: The mathematical problem or query (string format).
2. **Instruction**: A static instruction prompting the AI to solve the question.
3. **Output**: The solution or explanation provided for the question (string format).
### Problem Categories
The dataset includes the following categories:
- **Arithmetic**: Basic addition, subtraction, multiplication, and division problems.
- **Algebra**: Simplification of expressions, solving equations, etc.
- **Calculus**: Derivatives, integrals, and basic calculus concepts.
- **Geometry**: Area, perimeter, and other geometric calculations.
- **Mathematical Constants**: Queries about constants like π, e, etc.
---
## Applications
This dataset is suitable for:
- Training natural language models for mathematics tutoring or assistance.
- Fine-tuning language models like GPT on mathematical reasoning tasks.
- Building educational chatbots capable of solving math problems.
---
## File Format
The dataset is saved in **CSV format** with the following structure:
| **input** | **instruction** | **output** |
|------------------------|-----------------------------------------------------------|---------------------|
| What is 12 + 35? | Solve the mathematical question and provide the correct answer. | The answer is 47. |
---
# 用于AI模型训练的数学数据集(Mathematics Dataset for AI Model Training)
本数据集包含**68000条**数学题目及对应解答,旨在训练可解决各类数学问题、或为多种数学概念提供分步解释的AI模型。数据集共包含三列:**输入(Input)**、**指令(Instruction)**与**输出(Output)**。
## 数据集概览(Dataset Overview)
- **输入(Input)**:数学题目或问题描述(例如算术、代数、微积分、几何问题或数学常数相关查询)。
- **指令(Instruction)**:通用提示,用于指引AI模型求解问题并给出正确答案。
- **输出(Output)**:针对输入问题的解答或响应内容。
### 示例条目(Example Rows)
| **输入(Input)** | **指令(Instruction)** | **输出(Output)** |
|----------------------------------|------------------------------------------------------------------|-----------------------------------------------|
| 15加32等于多少? | 求解该数学问题并给出正确答案。 | 答案为47。 |
| 求$x^2 + 4x$的导数。 | 求解该数学问题并给出正确答案。 | 导数为$2x + 4$。 |
| 半径为7的圆的面积是多少? | 求解该数学问题并给出正确答案。 | 圆的面积约为153.938平方单位。 |
| 化简表达式:$3x + 7x$。 | 求解该数学问题并给出正确答案。 | 化简后的表达式为$10x$。 |
---
## 数据集特性(Dataset Features)
### 列信息(Columns)
1. **输入(Input)**:数学问题或查询(字符串格式)。
2. **指令(Instruction)**:固定提示,用于引导AI模型求解对应问题。
3. **输出(Output)**:针对该问题的解答或解释(字符串格式)。
### 问题分类(Problem Categories)
本数据集涵盖以下类别:
- **算术(Arithmetic)**:基础加减乘除运算题目。
- **代数(Algebra)**:表达式化简、方程求解等题型。
- **微积分(Calculus)**:导数、积分及基础微积分概念相关题目。
- **几何(Geometry)**:面积、周长及其他几何计算类题目。
- **数学常数(Mathematical Constants)**:针对$pi$、$e$等数学常数的查询。
---
## 应用场景(Applications)
本数据集适用于:
- 训练用于数学辅导或辅助答疑的自然语言模型。
- 针对数学推理任务对GPT等大语言模型(Large Language Model,LLM)进行微调。
- 搭建可求解数学问题的教育聊天机器人。
---
## 文件格式(File Format)
本数据集以**CSV格式**存储,结构如下:
| **输入(Input)** | **指令(Instruction)** | **输出(Output)** |
|--------------------------|------------------------------------------------------------------|-----------------------------------|
| 12加35等于多少? | 求解该数学问题并给出正确答案。 | 答案为47。 |
提供机构:
maas
创建时间:
2025-01-04



