five

Murphy128/hendrycks_math

收藏
Hugging Face2026-04-15 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Murphy128/hendrycks_math
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit dataset_info: - config_name: algebra features: - name: problem dtype: string - name: level dtype: string - name: type dtype: string - name: solution dtype: string splits: - name: train num_bytes: 955021 num_examples: 1744 - name: test num_bytes: 648291 num_examples: 1187 download_size: 858300 dataset_size: 1603312 - config_name: counting_and_probability features: - name: problem dtype: string - name: level dtype: string - name: type dtype: string - name: solution dtype: string splits: - name: train num_bytes: 667385 num_examples: 771 - name: test num_bytes: 353803 num_examples: 474 download_size: 504386 dataset_size: 1021188 - config_name: geometry features: - name: problem dtype: string - name: level dtype: string - name: type dtype: string - name: solution dtype: string splits: - name: train num_bytes: 1077241 num_examples: 870 - name: test num_bytes: 523126 num_examples: 479 download_size: 813223 dataset_size: 1600367 - config_name: intermediate_algebra features: - name: problem dtype: string - name: level dtype: string - name: type dtype: string - name: solution dtype: string splits: - name: train num_bytes: 1157476 num_examples: 1295 - name: test num_bytes: 795070 num_examples: 903 download_size: 969951 dataset_size: 1952546 - config_name: number_theory features: - name: problem dtype: string - name: level dtype: string - name: type dtype: string - name: solution dtype: string splits: - name: train num_bytes: 595793 num_examples: 869 - name: test num_bytes: 349455 num_examples: 540 download_size: 490656 dataset_size: 945248 - config_name: prealgebra features: - name: problem dtype: string - name: level dtype: string - name: type dtype: string - name: solution dtype: string splits: - name: train num_bytes: 715611 num_examples: 1205 - name: test num_bytes: 510195 num_examples: 871 download_size: 651355 dataset_size: 1225806 - config_name: precalculus features: - name: problem dtype: string - name: level dtype: string - name: type dtype: string - name: solution dtype: string splits: - name: train num_bytes: 816245 num_examples: 746 - name: test num_bytes: 552893 num_examples: 546 download_size: 595986 dataset_size: 1369138 configs: - config_name: algebra data_files: - split: train path: algebra/train-* - split: test path: algebra/test-* - config_name: counting_and_probability data_files: - split: train path: counting_and_probability/train-* - split: test path: counting_and_probability/test-* - config_name: geometry data_files: - split: train path: geometry/train-* - split: test path: geometry/test-* - config_name: intermediate_algebra data_files: - split: train path: intermediate_algebra/train-* - split: test path: intermediate_algebra/test-* - config_name: number_theory data_files: - split: train path: number_theory/train-* - split: test path: number_theory/test-* - config_name: prealgebra data_files: - split: train path: prealgebra/train-* - split: test path: prealgebra/test-* - config_name: precalculus data_files: - split: train path: precalculus/train-* - split: test path: precalculus/test-* --- ## Dataset Summary MATH dataset from https://github.com/hendrycks/math ### Citation Information ``` @article{hendrycksmath2021, title={Measuring Mathematical Problem Solving With the MATH Dataset}, author={Dan Hendrycks and Collin Burns and Saurav Kadavath and Akul Arora and Steven Basart and Eric Tang and Dawn Song and Jacob Steinhardt}, journal={NeurIPS}, year={2021} } ```

许可证:MIT 数据集信息: - 配置名称:代数(algebra) 特征字段: - 题目(problem):数据类型(dtype)为字符串 - 难度等级(level):数据类型(dtype)为字符串 - 题型(type):数据类型(dtype)为字符串 - 解答(solution):数据类型(dtype)为字符串 数据集划分: - 训练集(train):字节数955021,样本量1744 - 测试集(test):字节数648291,样本量1187 下载大小:858300,数据集总大小:1603312 - 配置名称:计数与概率(counting_and_probability) 特征字段: - 题目(problem):数据类型(dtype)为字符串 - 难度等级(level):数据类型(dtype)为字符串 - 题型(type):数据类型(dtype)为字符串 - 解答(solution):数据类型(dtype)为字符串 数据集划分: - 训练集(train):字节数667385,样本量771 - 测试集(test):字节数353803,样本量474 下载大小:504386,数据集总大小:1021188 - 配置名称:几何学(geometry) 特征字段: - 题目(problem):数据类型(dtype)为字符串 - 难度等级(level):数据类型(dtype)为字符串 - 题型(type):数据类型(dtype)为字符串 - 解答(solution):数据类型(dtype)为字符串 数据集划分: - 训练集(train):字节数1077241,样本量870 - 测试集(test):字节数523126,样本量479 下载大小:813223,数据集总大小:1600367 - 配置名称:中级代数(intermediate_algebra) 特征字段: - 题目(problem):数据类型(dtype)为字符串 - 难度等级(level):数据类型(dtype)为字符串 - 题型(type):数据类型(dtype)为字符串 - 解答(solution):数据类型(dtype)为字符串 数据集划分: - 训练集(train):字节数1157476,样本量1295 - 测试集(test):字节数795070,样本量903 下载大小:969951,数据集总大小:1952546 - 配置名称:数论(number_theory) 特征字段: - 题目(problem):数据类型(dtype)为字符串 - 难度等级(level):数据类型(dtype)为字符串 - 题型(type):数据类型(dtype)为字符串 - 解答(solution):数据类型(dtype)为字符串 数据集划分: - 训练集(train):字节数595793,样本量869 - 测试集(test):字节数349455,样本量540 下载大小:490656,数据集总大小:945248 - 配置名称:初等代数(prealgebra) 特征字段: - 题目(problem):数据类型(dtype)为字符串 - 难度等级(level):数据类型(dtype)为字符串 - 题型(type):数据类型(dtype)为字符串 - 解答(solution):数据类型(dtype)为字符串 数据集划分: - 训练集(train):字节数715611,样本量1205 - 测试集(test):字节数510195,样本量871 下载大小:651355,数据集总大小:1225806 - 配置名称:预备微积分(precalculus) 特征字段: - 题目(problem):数据类型(dtype)为字符串 - 难度等级(level):数据类型(dtype)为字符串 - 题型(type):数据类型(dtype)为字符串 - 解答(solution):数据类型(dtype)为字符串 数据集划分: - 训练集(train):字节数816245,样本量746 - 测试集(test):字节数552893,样本量546 下载大小:595986,数据集总大小:1369138 数据集配置: - 配置名称:代数(algebra):数据文件路径为训练集对应"algebra/train-*",测试集对应"algebra/test-*" - 配置名称:计数与概率(counting_and_probability):训练集对应"counting_and_probability/train-*",测试集对应"counting_and_probability/test-*" - 配置名称:几何学(geometry):训练集对应"geometry/train-*",测试集对应"geometry/test-*" - 配置名称:中级代数(intermediate_algebra):训练集对应"intermediate_algebra/train-*",测试集对应"intermediate_algebra/test-*" - 配置名称:数论(number_theory):训练集对应"number_theory/train-*",测试集对应"number_theory/test-*" - 配置名称:初等代数(prealgebra):训练集对应"prealgebra/train-*",测试集对应"prealgebra/test-*" - 配置名称:预备微积分(precalculus):训练集对应"precalculus/train-*",测试集对应"precalculus/test-*" ## 数据集概述 本数据集为源自https://github.com/hendrycks/math的MATH数据集。 ### 引用信息 @article{hendrycksmath2021, title={使用MATH数据集衡量数学解题能力}, author={Dan Hendrycks、Collin Burns、Saurav Kadavath、Akul Arora、Steven Basart、Eric Tang、Dawn Song、Jacob Steinhardt}, journal={神经信息处理系统大会(NeurIPS)}, year={2021} }
提供机构:
Murphy128
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作