Murphy128/hendrycks_math
收藏Hugging Face2026-04-15 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Murphy128/hendrycks_math
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
- config_name: algebra
features:
- name: problem
dtype: string
- name: level
dtype: string
- name: type
dtype: string
- name: solution
dtype: string
splits:
- name: train
num_bytes: 955021
num_examples: 1744
- name: test
num_bytes: 648291
num_examples: 1187
download_size: 858300
dataset_size: 1603312
- config_name: counting_and_probability
features:
- name: problem
dtype: string
- name: level
dtype: string
- name: type
dtype: string
- name: solution
dtype: string
splits:
- name: train
num_bytes: 667385
num_examples: 771
- name: test
num_bytes: 353803
num_examples: 474
download_size: 504386
dataset_size: 1021188
- config_name: geometry
features:
- name: problem
dtype: string
- name: level
dtype: string
- name: type
dtype: string
- name: solution
dtype: string
splits:
- name: train
num_bytes: 1077241
num_examples: 870
- name: test
num_bytes: 523126
num_examples: 479
download_size: 813223
dataset_size: 1600367
- config_name: intermediate_algebra
features:
- name: problem
dtype: string
- name: level
dtype: string
- name: type
dtype: string
- name: solution
dtype: string
splits:
- name: train
num_bytes: 1157476
num_examples: 1295
- name: test
num_bytes: 795070
num_examples: 903
download_size: 969951
dataset_size: 1952546
- config_name: number_theory
features:
- name: problem
dtype: string
- name: level
dtype: string
- name: type
dtype: string
- name: solution
dtype: string
splits:
- name: train
num_bytes: 595793
num_examples: 869
- name: test
num_bytes: 349455
num_examples: 540
download_size: 490656
dataset_size: 945248
- config_name: prealgebra
features:
- name: problem
dtype: string
- name: level
dtype: string
- name: type
dtype: string
- name: solution
dtype: string
splits:
- name: train
num_bytes: 715611
num_examples: 1205
- name: test
num_bytes: 510195
num_examples: 871
download_size: 651355
dataset_size: 1225806
- config_name: precalculus
features:
- name: problem
dtype: string
- name: level
dtype: string
- name: type
dtype: string
- name: solution
dtype: string
splits:
- name: train
num_bytes: 816245
num_examples: 746
- name: test
num_bytes: 552893
num_examples: 546
download_size: 595986
dataset_size: 1369138
configs:
- config_name: algebra
data_files:
- split: train
path: algebra/train-*
- split: test
path: algebra/test-*
- config_name: counting_and_probability
data_files:
- split: train
path: counting_and_probability/train-*
- split: test
path: counting_and_probability/test-*
- config_name: geometry
data_files:
- split: train
path: geometry/train-*
- split: test
path: geometry/test-*
- config_name: intermediate_algebra
data_files:
- split: train
path: intermediate_algebra/train-*
- split: test
path: intermediate_algebra/test-*
- config_name: number_theory
data_files:
- split: train
path: number_theory/train-*
- split: test
path: number_theory/test-*
- config_name: prealgebra
data_files:
- split: train
path: prealgebra/train-*
- split: test
path: prealgebra/test-*
- config_name: precalculus
data_files:
- split: train
path: precalculus/train-*
- split: test
path: precalculus/test-*
---
## Dataset Summary
MATH dataset from https://github.com/hendrycks/math
### Citation Information
```
@article{hendrycksmath2021,
title={Measuring Mathematical Problem Solving With the MATH Dataset},
author={Dan Hendrycks and Collin Burns and Saurav Kadavath and Akul Arora and Steven Basart and Eric Tang and Dawn Song and Jacob Steinhardt},
journal={NeurIPS},
year={2021}
}
```
许可证:MIT
数据集信息:
- 配置名称:代数(algebra)
特征字段:
- 题目(problem):数据类型(dtype)为字符串
- 难度等级(level):数据类型(dtype)为字符串
- 题型(type):数据类型(dtype)为字符串
- 解答(solution):数据类型(dtype)为字符串
数据集划分:
- 训练集(train):字节数955021,样本量1744
- 测试集(test):字节数648291,样本量1187
下载大小:858300,数据集总大小:1603312
- 配置名称:计数与概率(counting_and_probability)
特征字段:
- 题目(problem):数据类型(dtype)为字符串
- 难度等级(level):数据类型(dtype)为字符串
- 题型(type):数据类型(dtype)为字符串
- 解答(solution):数据类型(dtype)为字符串
数据集划分:
- 训练集(train):字节数667385,样本量771
- 测试集(test):字节数353803,样本量474
下载大小:504386,数据集总大小:1021188
- 配置名称:几何学(geometry)
特征字段:
- 题目(problem):数据类型(dtype)为字符串
- 难度等级(level):数据类型(dtype)为字符串
- 题型(type):数据类型(dtype)为字符串
- 解答(solution):数据类型(dtype)为字符串
数据集划分:
- 训练集(train):字节数1077241,样本量870
- 测试集(test):字节数523126,样本量479
下载大小:813223,数据集总大小:1600367
- 配置名称:中级代数(intermediate_algebra)
特征字段:
- 题目(problem):数据类型(dtype)为字符串
- 难度等级(level):数据类型(dtype)为字符串
- 题型(type):数据类型(dtype)为字符串
- 解答(solution):数据类型(dtype)为字符串
数据集划分:
- 训练集(train):字节数1157476,样本量1295
- 测试集(test):字节数795070,样本量903
下载大小:969951,数据集总大小:1952546
- 配置名称:数论(number_theory)
特征字段:
- 题目(problem):数据类型(dtype)为字符串
- 难度等级(level):数据类型(dtype)为字符串
- 题型(type):数据类型(dtype)为字符串
- 解答(solution):数据类型(dtype)为字符串
数据集划分:
- 训练集(train):字节数595793,样本量869
- 测试集(test):字节数349455,样本量540
下载大小:490656,数据集总大小:945248
- 配置名称:初等代数(prealgebra)
特征字段:
- 题目(problem):数据类型(dtype)为字符串
- 难度等级(level):数据类型(dtype)为字符串
- 题型(type):数据类型(dtype)为字符串
- 解答(solution):数据类型(dtype)为字符串
数据集划分:
- 训练集(train):字节数715611,样本量1205
- 测试集(test):字节数510195,样本量871
下载大小:651355,数据集总大小:1225806
- 配置名称:预备微积分(precalculus)
特征字段:
- 题目(problem):数据类型(dtype)为字符串
- 难度等级(level):数据类型(dtype)为字符串
- 题型(type):数据类型(dtype)为字符串
- 解答(solution):数据类型(dtype)为字符串
数据集划分:
- 训练集(train):字节数816245,样本量746
- 测试集(test):字节数552893,样本量546
下载大小:595986,数据集总大小:1369138
数据集配置:
- 配置名称:代数(algebra):数据文件路径为训练集对应"algebra/train-*",测试集对应"algebra/test-*"
- 配置名称:计数与概率(counting_and_probability):训练集对应"counting_and_probability/train-*",测试集对应"counting_and_probability/test-*"
- 配置名称:几何学(geometry):训练集对应"geometry/train-*",测试集对应"geometry/test-*"
- 配置名称:中级代数(intermediate_algebra):训练集对应"intermediate_algebra/train-*",测试集对应"intermediate_algebra/test-*"
- 配置名称:数论(number_theory):训练集对应"number_theory/train-*",测试集对应"number_theory/test-*"
- 配置名称:初等代数(prealgebra):训练集对应"prealgebra/train-*",测试集对应"prealgebra/test-*"
- 配置名称:预备微积分(precalculus):训练集对应"precalculus/train-*",测试集对应"precalculus/test-*"
## 数据集概述
本数据集为源自https://github.com/hendrycks/math的MATH数据集。
### 引用信息
@article{hendrycksmath2021,
title={使用MATH数据集衡量数学解题能力},
author={Dan Hendrycks、Collin Burns、Saurav Kadavath、Akul Arora、Steven Basart、Eric Tang、Dawn Song、Jacob Steinhardt},
journal={神经信息处理系统大会(NeurIPS)},
year={2021}
}
提供机构:
Murphy128



