five

HanxuHU/MMMU_filter

收藏
Hugging Face2024-04-06 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/HanxuHU/MMMU_filter
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: Accounting features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 106541.06666666667 num_examples: 2 download_size: 188911 dataset_size: 106541.06666666667 - config_name: Agriculture features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 119217398.0 num_examples: 30 download_size: 119223107 dataset_size: 119217398.0 - config_name: Architecture_and_Engineering features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 432711.2 num_examples: 18 download_size: 467361 dataset_size: 432711.2 - config_name: Art features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 29934374.0 num_examples: 30 download_size: 29939738 dataset_size: 29934374.0 - config_name: Art_Theory features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 33481398.0 num_examples: 30 download_size: 29783868 dataset_size: 33481398.0 - config_name: Basic_Medical_Science features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 3988243.6333333333 num_examples: 29 download_size: 4093528 dataset_size: 3988243.6333333333 - config_name: Biology features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 7642516.399999999 num_examples: 27 download_size: 8021775 dataset_size: 7642516.399999999 - config_name: Chemistry features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1366537.8 num_examples: 27 download_size: 1362901 dataset_size: 1366537.8 - config_name: Clinical_Medicine features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 10882324.0 num_examples: 30 download_size: 10888251 dataset_size: 10882324.0 - config_name: Computer_Science features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1933724.1333333333 num_examples: 28 download_size: 2009738 dataset_size: 1933724.1333333333 - config_name: Design features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 17922960.0 num_examples: 30 download_size: 16227878 dataset_size: 17922960.0 - config_name: Diagnostics_and_Laboratory_Medicine features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 37106073.0 num_examples: 30 download_size: 37089865 dataset_size: 37106073.0 - config_name: Economics features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 644315.3666666667 num_examples: 13 download_size: 927250 dataset_size: 644315.3666666667 - config_name: Electronics features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 641217.0 num_examples: 30 download_size: 644538 dataset_size: 641217.0 - config_name: Energy_and_Power features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1641775.0 num_examples: 30 download_size: 1646107 dataset_size: 1641775.0 - config_name: Finance features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 35696.36666666667 num_examples: 1 download_size: 31566 dataset_size: 35696.36666666667 - config_name: Geography features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 6448781.533333333 num_examples: 29 download_size: 6611992 dataset_size: 6448781.533333333 - config_name: History features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 8231713.2 num_examples: 28 download_size: 8206800 dataset_size: 8231713.2 - config_name: Literature features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 14240886.0 num_examples: 30 download_size: 14246788 dataset_size: 14240886.0 - config_name: Manage features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1966381.8 num_examples: 18 download_size: 2083274 dataset_size: 1966381.8 - config_name: Marketing features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 343658.13333333336 num_examples: 7 download_size: 859324 dataset_size: 343658.13333333336 - config_name: Materials features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1997713.0 num_examples: 26 download_size: 2199364 dataset_size: 1997713.0 - config_name: Math features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1396152.7 num_examples: 29 download_size: 1435925 dataset_size: 1396152.7 - config_name: Mechanical_Engineering features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 874828.0 num_examples: 30 download_size: 876772 dataset_size: 874828.0 - config_name: Music features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 9359212.0 num_examples: 30 download_size: 9363650 dataset_size: 9359212.0 - config_name: Pharmacy features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1435395.4 num_examples: 26 download_size: 1330630 dataset_size: 1435395.4 - config_name: Physics features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 1113970.0 num_examples: 30 download_size: 1117086 dataset_size: 1113970.0 - config_name: Psychology features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 3963314.7 num_examples: 27 download_size: 3978658 dataset_size: 3963314.7 - config_name: Public_Health features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 251434.0 num_examples: 5 download_size: 672165 dataset_size: 251434.0 - config_name: Sociology features: - name: id dtype: string - name: question dtype: string - name: options dtype: string - name: explanation dtype: string - name: image_1 dtype: image - name: image_2 dtype: image - name: image_3 dtype: image - name: image_4 dtype: image - name: image_5 dtype: image - name: image_6 dtype: image - name: image_7 dtype: image - name: img_type dtype: string - name: answer dtype: string - name: topic_difficulty dtype: string - name: question_type dtype: string - name: subfield dtype: string splits: - name: validation num_bytes: 17840003.766666666 num_examples: 29 download_size: 17595987 dataset_size: 17840003.766666666 configs: - config_name: Accounting data_files: - split: validation path: Accounting/validation-* - config_name: Agriculture data_files: - split: validation path: Agriculture/validation-* - config_name: Architecture_and_Engineering data_files: - split: validation path: Architecture_and_Engineering/validation-* - config_name: Art data_files: - split: validation path: Art/validation-* - config_name: Art_Theory data_files: - split: validation path: Art_Theory/validation-* - config_name: Basic_Medical_Science data_files: - split: validation path: Basic_Medical_Science/validation-* - config_name: Biology data_files: - split: validation path: Biology/validation-* - config_name: Chemistry data_files: - split: validation path: Chemistry/validation-* - config_name: Clinical_Medicine data_files: - split: validation path: Clinical_Medicine/validation-* - config_name: Computer_Science data_files: - split: validation path: Computer_Science/validation-* - config_name: Design data_files: - split: validation path: Design/validation-* - config_name: Diagnostics_and_Laboratory_Medicine data_files: - split: validation path: Diagnostics_and_Laboratory_Medicine/validation-* - config_name: Economics data_files: - split: validation path: Economics/validation-* - config_name: Electronics data_files: - split: validation path: Electronics/validation-* - config_name: Energy_and_Power data_files: - split: validation path: Energy_and_Power/validation-* - config_name: Finance data_files: - split: validation path: Finance/validation-* - config_name: Geography data_files: - split: validation path: Geography/validation-* - config_name: History data_files: - split: validation path: History/validation-* - config_name: Literature data_files: - split: validation path: Literature/validation-* - config_name: Manage data_files: - split: validation path: Manage/validation-* - config_name: Marketing data_files: - split: validation path: Marketing/validation-* - config_name: Materials data_files: - split: validation path: Materials/validation-* - config_name: Math data_files: - split: validation path: Math/validation-* - config_name: Mechanical_Engineering data_files: - split: validation path: Mechanical_Engineering/validation-* - config_name: Music data_files: - split: validation path: Music/validation-* - config_name: Pharmacy data_files: - split: validation path: Pharmacy/validation-* - config_name: Physics data_files: - split: validation path: Physics/validation-* - config_name: Psychology data_files: - split: validation path: Psychology/validation-* - config_name: Public_Health data_files: - split: validation path: Public_Health/validation-* - config_name: Sociology data_files: - split: validation path: Sociology/validation-* ---

数据集信息: 本数据集涵盖30个细分学科领域的子配置,所有子配置的特征字段结构完全一致,具体配置详情如下: 通用特征字段说明: - 字段名:id,含义:样本唯一标识符,数据类型:字符串(string) - 字段名:question,含义:试题题干,数据类型:字符串(string) - 字段名:options,含义:试题选项,数据类型:字符串(string) - 字段名:explanation,含义:试题解析,数据类型:字符串(string) - 字段名:image_1,含义:配套图片1,数据类型:图像(image) - 字段名:image_2,含义:配套图片2,数据类型:图像(image) - 字段名:image_3,含义:配套图片3,数据类型:图像(image) - 字段名:image_4,含义:配套图片4,数据类型:图像(image) - 字段名:image_5,含义:配套图片5,数据类型:图像(image) - 字段名:image_6,含义:配套图片6,数据类型:图像(image) - 字段名:image_7,含义:配套图片7,数据类型:图像(image) - 字段名:img_type,含义:图片类型,数据类型:字符串(string) - 字段名:answer,含义:正确答案,数据类型:字符串(string) - 字段名:topic_difficulty,含义:试题难度等级,数据类型:字符串(string) - 字段名:question_type,含义:试题题型,数据类型:字符串(string) - 字段名:subfield,含义:学科子领域,数据类型:字符串(string) 各学科子配置详情: - 配置名称:会计学(Accounting) 数据集划分:验证集,字节数:106541.07(近似值),样本数量:2,下载大小:188911,数据集大小:106541.07(近似值) - 配置名称:农学(Agriculture) 数据集划分:验证集,字节数:119217398.0,样本数量:30,下载大小:119223107,数据集大小:119217398.0 - 配置名称:建筑学与工程学(Architecture_and_Engineering) 数据集划分:验证集,字节数:432711.2,样本数量:18,下载大小:467361,数据集大小:432711.2 - 配置名称:艺术学(Art) 数据集划分:验证集,字节数:29934374.0,样本数量:30,下载大小:29939738,数据集大小:29934374.0 - 配置名称:艺术理论(Art_Theory) 数据集划分:验证集,字节数:33481398.0,样本数量:30,下载大小:29783868,数据集大小:33481398.0 - 配置名称:基础医学(Basic_Medical_Science) 数据集划分:验证集,字节数:3988243.63(近似值),样本数量:29,下载大小:4093528,数据集大小:3988243.63(近似值) - 配置名称:生物学(Biology) 数据集划分:验证集,字节数:7642516.40(近似值),样本数量:27,下载大小:8021775,数据集大小:7642516.40(近似值) - 配置名称:化学(Chemistry) 数据集划分:验证集,字节数:1366537.8,样本数量:27,下载大小:1362901,数据集大小:1366537.8 - 配置名称:临床医学(Clinical_Medicine) 数据集划分:验证集,字节数:10882324.0,样本数量:30,下载大小:10888251,数据集大小:10882324.0 - 配置名称:计算机科学(Computer_Science) 数据集划分:验证集,字节数:1933724.13(近似值),样本数量:28,下载大小:2009738,数据集大小:1933724.13(近似值) - 配置名称:设计学(Design) 数据集划分:验证集,字节数:17922960.0,样本数量:30,下载大小:16227878,数据集大小:17922960.0 - 配置名称:诊断学与检验医学(Diagnostics_and_Laboratory_Medicine) 数据集划分:验证集,字节数:37106073.0,样本数量:30,下载大小:37089865,数据集大小:37106073.0 - 配置名称:经济学(Economics) 数据集划分:验证集,字节数:644315.37(近似值),样本数量:13,下载大小:927250,数据集大小:644315.37(近似值) - 配置名称:电子学(Electronics) 数据集划分:验证集,字节数:641217.0,样本数量:30,下载大小:644538,数据集大小:641217.0 - 配置名称:能源与动力工程(Energy_and_Power) 数据集划分:验证集,字节数:1641775.0,样本数量:30,下载大小:1646107,数据集大小:1641775.0 - 配置名称:金融学(Finance) 数据集划分:验证集,字节数:35696.37(近似值),样本数量:1,下载大小:31566,数据集大小:35696.37(近似值) - 配置名称:地理学(Geography) 数据集划分:验证集,字节数:6448781.53(近似值),样本数量:29,下载大小:6611992,数据集大小:6448781.53(近似值) - 配置名称:历史学(History) 数据集划分:验证集,字节数:8231713.2,样本数量:28,下载大小:8206800,数据集大小:8231713.2 - 配置名称:文学(Literature) 数据集划分:验证集,字节数:14240886.0,样本数量:30,下载大小:14246788,数据集大小:14240886.0 - 配置名称:管理学(Manage) 数据集划分:验证集,字节数:1966381.8,样本数量:18,下载大小:2083274,数据集大小:1966381.8 - 配置名称:市场营销学(Marketing) 数据集划分:验证集,字节数:343658.13(近似值),样本数量:7,下载大小:859324,数据集大小:343658.13(近似值) - 配置名称:材料科学(Materials) 数据集划分:验证集,字节数:1997713.0,样本数量:26,下载大小:2199364,数据集大小:1997713.0 - 配置名称:数学(Math) 数据集划分:验证集,字节数:1396152.7,样本数量:29,下载大小:1435925,数据集大小:1396152.7 - 配置名称:机械工程学(Mechanical_Engineering) 数据集划分:验证集,字节数:874828.0,样本数量:30,下载大小:876772,数据集大小:874828.0 - 配置名称:音乐学(Music) 数据集划分:验证集,字节数:9359212.0,样本数量:30,下载大小:9363650,数据集大小:9359212.0 - 配置名称:药学(Pharmacy) 数据集划分:验证集,字节数:1435395.4,样本数量:26,下载大小:1330630,数据集大小:1435395.4 - 配置名称:物理学(Physics) 数据集划分:验证集,字节数:1113970.0,样本数量:30,下载大小:1117086,数据集大小:1113970.0 - 配置名称:心理学(Psychology) 数据集划分:验证集,字节数:3963314.7,样本数量:27,下载大小:3978658,数据集大小:3963314.7 - 配置名称:公共卫生学(Public_Health) 数据集划分:验证集,字节数:251434.0,样本数量:5,下载大小:672165,数据集大小:251434.0 - 配置名称:社会学(Sociology) 数据集划分:验证集,字节数:17840003.77(近似值),样本数量:29,下载大小:17595987,数据集大小:17840003.77(近似值) 数据集配置详情: 所有子配置的数据文件均对应验证集划分,文件路径格式为「[配置英文名称]/validation-*」,具体如下: - 配置名称:会计学(Accounting),数据文件路径:Accounting/validation-* - 配置名称:农学(Agriculture),数据文件路径:Agriculture/validation-* - 配置名称:建筑学与工程学(Architecture_and_Engineering),数据文件路径:Architecture_and_Engineering/validation-* - 配置名称:艺术学(Art),数据文件路径:Art/validation-* - 配置名称:艺术理论(Art_Theory),数据文件路径:Art_Theory/validation-* - 配置名称:基础医学(Basic_Medical_Science),数据文件路径:Basic_Medical_Science/validation-* - 配置名称:生物学(Biology),数据文件路径:Biology/validation-* - 配置名称:化学(Chemistry),数据文件路径:Chemistry/validation-* - 配置名称:临床医学(Clinical_Medicine),数据文件路径:Clinical_Medicine/validation-* - 配置名称:计算机科学(Computer_Science),数据文件路径:Computer_Science/validation-* - 配置名称:设计学(Design),数据文件路径:Design/validation-* - 配置名称:诊断学与检验医学(Diagnostics_and_Laboratory_Medicine),数据文件路径:Diagnostics_and_Laboratory_Medicine/validation-* - 配置名称:经济学(Economics),数据文件路径:Economics/validation-* - 配置名称:电子学(Electronics),数据文件路径:Electronics/validation-* - 配置名称:能源与动力工程(Energy_and_Power),数据文件路径:Energy_and_Power/validation-* - 配置名称:金融学(Finance),数据文件路径:Finance/validation-* - 配置名称:地理学(Geography),数据文件路径:Geography/validation-* - 配置名称:历史学(History),数据文件路径:History/validation-* - 配置名称:文学(Literature),数据文件路径:Literature/validation-* - 配置名称:管理学(Manage),数据文件路径:Manage/validation-* - 配置名称:市场营销学(Marketing),数据文件路径:Marketing/validation-* - 配置名称:材料科学(Materials),数据文件路径:Materials/validation-* - 配置名称:数学(Math),数据文件路径:Math/validation-* - 配置名称:机械工程学(Mechanical_Engineering),数据文件路径:Mechanical_Engineering/validation-* - 配置名称:音乐学(Music),数据文件路径:Music/validation-* - 配置名称:药学(Pharmacy),数据文件路径:Pharmacy/validation-* - 配置名称:物理学(Physics),数据文件路径:Physics/validation-* - 配置名称:心理学(Psychology),数据文件路径:Psychology/validation-* - 配置名称:公共卫生学(Public_Health),数据文件路径:Public_Health/validation-* - 配置名称:社会学(Sociology),数据文件路径:Sociology/validation-*
提供机构:
HanxuHU
原始信息汇总

数据集概述

本数据集包含多个子数据集,每个子数据集对应不同的学科领域,具体包括会计、农业、建筑与工程、艺术、艺术理论、基础医学科学、生物学、化学、临床医学、计算机科学、设计、诊断与实验室医学、经济学、电子学、能源与电力、金融、地理、历史、文学、管理、市场营销、材料科学、数学、机械工程、音乐和药学。

数据集特征

每个子数据集包含以下特征:

  • id: 数据标识符,类型为字符串。
  • question: 问题描述,类型为字符串。
  • options: 选项,类型为字符串。
  • explanation: 解释,类型为字符串。
  • image_1image_7: 图像文件,类型为图像。
  • img_type: 图像类型,类型为字符串。
  • answer: 答案,类型为字符串。
  • topic_difficulty: 主题难度,类型为字符串。
  • question_type: 问题类型,类型为字符串。
  • subfield: 子领域,类型为字符串。

数据集分割

每个子数据集都包含一个名为validation的分割,该分割提供了数据集的大小(以字节为单位)和示例数量。

数据集大小和下载大小

每个子数据集的validation分割提供了数据集的总大小(以字节为单位)和下载大小(以字节为单位)。这些信息有助于用户了解数据集的存储和传输需求。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作