HanxuHU/MMMU_filter
收藏Hugging Face2024-04-06 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/HanxuHU/MMMU_filter
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: Accounting
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 106541.06666666667
num_examples: 2
download_size: 188911
dataset_size: 106541.06666666667
- config_name: Agriculture
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 119217398.0
num_examples: 30
download_size: 119223107
dataset_size: 119217398.0
- config_name: Architecture_and_Engineering
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 432711.2
num_examples: 18
download_size: 467361
dataset_size: 432711.2
- config_name: Art
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 29934374.0
num_examples: 30
download_size: 29939738
dataset_size: 29934374.0
- config_name: Art_Theory
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 33481398.0
num_examples: 30
download_size: 29783868
dataset_size: 33481398.0
- config_name: Basic_Medical_Science
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 3988243.6333333333
num_examples: 29
download_size: 4093528
dataset_size: 3988243.6333333333
- config_name: Biology
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 7642516.399999999
num_examples: 27
download_size: 8021775
dataset_size: 7642516.399999999
- config_name: Chemistry
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1366537.8
num_examples: 27
download_size: 1362901
dataset_size: 1366537.8
- config_name: Clinical_Medicine
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 10882324.0
num_examples: 30
download_size: 10888251
dataset_size: 10882324.0
- config_name: Computer_Science
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1933724.1333333333
num_examples: 28
download_size: 2009738
dataset_size: 1933724.1333333333
- config_name: Design
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 17922960.0
num_examples: 30
download_size: 16227878
dataset_size: 17922960.0
- config_name: Diagnostics_and_Laboratory_Medicine
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 37106073.0
num_examples: 30
download_size: 37089865
dataset_size: 37106073.0
- config_name: Economics
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 644315.3666666667
num_examples: 13
download_size: 927250
dataset_size: 644315.3666666667
- config_name: Electronics
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 641217.0
num_examples: 30
download_size: 644538
dataset_size: 641217.0
- config_name: Energy_and_Power
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1641775.0
num_examples: 30
download_size: 1646107
dataset_size: 1641775.0
- config_name: Finance
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 35696.36666666667
num_examples: 1
download_size: 31566
dataset_size: 35696.36666666667
- config_name: Geography
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 6448781.533333333
num_examples: 29
download_size: 6611992
dataset_size: 6448781.533333333
- config_name: History
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 8231713.2
num_examples: 28
download_size: 8206800
dataset_size: 8231713.2
- config_name: Literature
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 14240886.0
num_examples: 30
download_size: 14246788
dataset_size: 14240886.0
- config_name: Manage
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1966381.8
num_examples: 18
download_size: 2083274
dataset_size: 1966381.8
- config_name: Marketing
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 343658.13333333336
num_examples: 7
download_size: 859324
dataset_size: 343658.13333333336
- config_name: Materials
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1997713.0
num_examples: 26
download_size: 2199364
dataset_size: 1997713.0
- config_name: Math
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1396152.7
num_examples: 29
download_size: 1435925
dataset_size: 1396152.7
- config_name: Mechanical_Engineering
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 874828.0
num_examples: 30
download_size: 876772
dataset_size: 874828.0
- config_name: Music
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 9359212.0
num_examples: 30
download_size: 9363650
dataset_size: 9359212.0
- config_name: Pharmacy
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1435395.4
num_examples: 26
download_size: 1330630
dataset_size: 1435395.4
- config_name: Physics
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 1113970.0
num_examples: 30
download_size: 1117086
dataset_size: 1113970.0
- config_name: Psychology
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 3963314.7
num_examples: 27
download_size: 3978658
dataset_size: 3963314.7
- config_name: Public_Health
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 251434.0
num_examples: 5
download_size: 672165
dataset_size: 251434.0
- config_name: Sociology
features:
- name: id
dtype: string
- name: question
dtype: string
- name: options
dtype: string
- name: explanation
dtype: string
- name: image_1
dtype: image
- name: image_2
dtype: image
- name: image_3
dtype: image
- name: image_4
dtype: image
- name: image_5
dtype: image
- name: image_6
dtype: image
- name: image_7
dtype: image
- name: img_type
dtype: string
- name: answer
dtype: string
- name: topic_difficulty
dtype: string
- name: question_type
dtype: string
- name: subfield
dtype: string
splits:
- name: validation
num_bytes: 17840003.766666666
num_examples: 29
download_size: 17595987
dataset_size: 17840003.766666666
configs:
- config_name: Accounting
data_files:
- split: validation
path: Accounting/validation-*
- config_name: Agriculture
data_files:
- split: validation
path: Agriculture/validation-*
- config_name: Architecture_and_Engineering
data_files:
- split: validation
path: Architecture_and_Engineering/validation-*
- config_name: Art
data_files:
- split: validation
path: Art/validation-*
- config_name: Art_Theory
data_files:
- split: validation
path: Art_Theory/validation-*
- config_name: Basic_Medical_Science
data_files:
- split: validation
path: Basic_Medical_Science/validation-*
- config_name: Biology
data_files:
- split: validation
path: Biology/validation-*
- config_name: Chemistry
data_files:
- split: validation
path: Chemistry/validation-*
- config_name: Clinical_Medicine
data_files:
- split: validation
path: Clinical_Medicine/validation-*
- config_name: Computer_Science
data_files:
- split: validation
path: Computer_Science/validation-*
- config_name: Design
data_files:
- split: validation
path: Design/validation-*
- config_name: Diagnostics_and_Laboratory_Medicine
data_files:
- split: validation
path: Diagnostics_and_Laboratory_Medicine/validation-*
- config_name: Economics
data_files:
- split: validation
path: Economics/validation-*
- config_name: Electronics
data_files:
- split: validation
path: Electronics/validation-*
- config_name: Energy_and_Power
data_files:
- split: validation
path: Energy_and_Power/validation-*
- config_name: Finance
data_files:
- split: validation
path: Finance/validation-*
- config_name: Geography
data_files:
- split: validation
path: Geography/validation-*
- config_name: History
data_files:
- split: validation
path: History/validation-*
- config_name: Literature
data_files:
- split: validation
path: Literature/validation-*
- config_name: Manage
data_files:
- split: validation
path: Manage/validation-*
- config_name: Marketing
data_files:
- split: validation
path: Marketing/validation-*
- config_name: Materials
data_files:
- split: validation
path: Materials/validation-*
- config_name: Math
data_files:
- split: validation
path: Math/validation-*
- config_name: Mechanical_Engineering
data_files:
- split: validation
path: Mechanical_Engineering/validation-*
- config_name: Music
data_files:
- split: validation
path: Music/validation-*
- config_name: Pharmacy
data_files:
- split: validation
path: Pharmacy/validation-*
- config_name: Physics
data_files:
- split: validation
path: Physics/validation-*
- config_name: Psychology
data_files:
- split: validation
path: Psychology/validation-*
- config_name: Public_Health
data_files:
- split: validation
path: Public_Health/validation-*
- config_name: Sociology
data_files:
- split: validation
path: Sociology/validation-*
---
数据集信息:
本数据集涵盖30个细分学科领域的子配置,所有子配置的特征字段结构完全一致,具体配置详情如下:
通用特征字段说明:
- 字段名:id,含义:样本唯一标识符,数据类型:字符串(string)
- 字段名:question,含义:试题题干,数据类型:字符串(string)
- 字段名:options,含义:试题选项,数据类型:字符串(string)
- 字段名:explanation,含义:试题解析,数据类型:字符串(string)
- 字段名:image_1,含义:配套图片1,数据类型:图像(image)
- 字段名:image_2,含义:配套图片2,数据类型:图像(image)
- 字段名:image_3,含义:配套图片3,数据类型:图像(image)
- 字段名:image_4,含义:配套图片4,数据类型:图像(image)
- 字段名:image_5,含义:配套图片5,数据类型:图像(image)
- 字段名:image_6,含义:配套图片6,数据类型:图像(image)
- 字段名:image_7,含义:配套图片7,数据类型:图像(image)
- 字段名:img_type,含义:图片类型,数据类型:字符串(string)
- 字段名:answer,含义:正确答案,数据类型:字符串(string)
- 字段名:topic_difficulty,含义:试题难度等级,数据类型:字符串(string)
- 字段名:question_type,含义:试题题型,数据类型:字符串(string)
- 字段名:subfield,含义:学科子领域,数据类型:字符串(string)
各学科子配置详情:
- 配置名称:会计学(Accounting)
数据集划分:验证集,字节数:106541.07(近似值),样本数量:2,下载大小:188911,数据集大小:106541.07(近似值)
- 配置名称:农学(Agriculture)
数据集划分:验证集,字节数:119217398.0,样本数量:30,下载大小:119223107,数据集大小:119217398.0
- 配置名称:建筑学与工程学(Architecture_and_Engineering)
数据集划分:验证集,字节数:432711.2,样本数量:18,下载大小:467361,数据集大小:432711.2
- 配置名称:艺术学(Art)
数据集划分:验证集,字节数:29934374.0,样本数量:30,下载大小:29939738,数据集大小:29934374.0
- 配置名称:艺术理论(Art_Theory)
数据集划分:验证集,字节数:33481398.0,样本数量:30,下载大小:29783868,数据集大小:33481398.0
- 配置名称:基础医学(Basic_Medical_Science)
数据集划分:验证集,字节数:3988243.63(近似值),样本数量:29,下载大小:4093528,数据集大小:3988243.63(近似值)
- 配置名称:生物学(Biology)
数据集划分:验证集,字节数:7642516.40(近似值),样本数量:27,下载大小:8021775,数据集大小:7642516.40(近似值)
- 配置名称:化学(Chemistry)
数据集划分:验证集,字节数:1366537.8,样本数量:27,下载大小:1362901,数据集大小:1366537.8
- 配置名称:临床医学(Clinical_Medicine)
数据集划分:验证集,字节数:10882324.0,样本数量:30,下载大小:10888251,数据集大小:10882324.0
- 配置名称:计算机科学(Computer_Science)
数据集划分:验证集,字节数:1933724.13(近似值),样本数量:28,下载大小:2009738,数据集大小:1933724.13(近似值)
- 配置名称:设计学(Design)
数据集划分:验证集,字节数:17922960.0,样本数量:30,下载大小:16227878,数据集大小:17922960.0
- 配置名称:诊断学与检验医学(Diagnostics_and_Laboratory_Medicine)
数据集划分:验证集,字节数:37106073.0,样本数量:30,下载大小:37089865,数据集大小:37106073.0
- 配置名称:经济学(Economics)
数据集划分:验证集,字节数:644315.37(近似值),样本数量:13,下载大小:927250,数据集大小:644315.37(近似值)
- 配置名称:电子学(Electronics)
数据集划分:验证集,字节数:641217.0,样本数量:30,下载大小:644538,数据集大小:641217.0
- 配置名称:能源与动力工程(Energy_and_Power)
数据集划分:验证集,字节数:1641775.0,样本数量:30,下载大小:1646107,数据集大小:1641775.0
- 配置名称:金融学(Finance)
数据集划分:验证集,字节数:35696.37(近似值),样本数量:1,下载大小:31566,数据集大小:35696.37(近似值)
- 配置名称:地理学(Geography)
数据集划分:验证集,字节数:6448781.53(近似值),样本数量:29,下载大小:6611992,数据集大小:6448781.53(近似值)
- 配置名称:历史学(History)
数据集划分:验证集,字节数:8231713.2,样本数量:28,下载大小:8206800,数据集大小:8231713.2
- 配置名称:文学(Literature)
数据集划分:验证集,字节数:14240886.0,样本数量:30,下载大小:14246788,数据集大小:14240886.0
- 配置名称:管理学(Manage)
数据集划分:验证集,字节数:1966381.8,样本数量:18,下载大小:2083274,数据集大小:1966381.8
- 配置名称:市场营销学(Marketing)
数据集划分:验证集,字节数:343658.13(近似值),样本数量:7,下载大小:859324,数据集大小:343658.13(近似值)
- 配置名称:材料科学(Materials)
数据集划分:验证集,字节数:1997713.0,样本数量:26,下载大小:2199364,数据集大小:1997713.0
- 配置名称:数学(Math)
数据集划分:验证集,字节数:1396152.7,样本数量:29,下载大小:1435925,数据集大小:1396152.7
- 配置名称:机械工程学(Mechanical_Engineering)
数据集划分:验证集,字节数:874828.0,样本数量:30,下载大小:876772,数据集大小:874828.0
- 配置名称:音乐学(Music)
数据集划分:验证集,字节数:9359212.0,样本数量:30,下载大小:9363650,数据集大小:9359212.0
- 配置名称:药学(Pharmacy)
数据集划分:验证集,字节数:1435395.4,样本数量:26,下载大小:1330630,数据集大小:1435395.4
- 配置名称:物理学(Physics)
数据集划分:验证集,字节数:1113970.0,样本数量:30,下载大小:1117086,数据集大小:1113970.0
- 配置名称:心理学(Psychology)
数据集划分:验证集,字节数:3963314.7,样本数量:27,下载大小:3978658,数据集大小:3963314.7
- 配置名称:公共卫生学(Public_Health)
数据集划分:验证集,字节数:251434.0,样本数量:5,下载大小:672165,数据集大小:251434.0
- 配置名称:社会学(Sociology)
数据集划分:验证集,字节数:17840003.77(近似值),样本数量:29,下载大小:17595987,数据集大小:17840003.77(近似值)
数据集配置详情:
所有子配置的数据文件均对应验证集划分,文件路径格式为「[配置英文名称]/validation-*」,具体如下:
- 配置名称:会计学(Accounting),数据文件路径:Accounting/validation-*
- 配置名称:农学(Agriculture),数据文件路径:Agriculture/validation-*
- 配置名称:建筑学与工程学(Architecture_and_Engineering),数据文件路径:Architecture_and_Engineering/validation-*
- 配置名称:艺术学(Art),数据文件路径:Art/validation-*
- 配置名称:艺术理论(Art_Theory),数据文件路径:Art_Theory/validation-*
- 配置名称:基础医学(Basic_Medical_Science),数据文件路径:Basic_Medical_Science/validation-*
- 配置名称:生物学(Biology),数据文件路径:Biology/validation-*
- 配置名称:化学(Chemistry),数据文件路径:Chemistry/validation-*
- 配置名称:临床医学(Clinical_Medicine),数据文件路径:Clinical_Medicine/validation-*
- 配置名称:计算机科学(Computer_Science),数据文件路径:Computer_Science/validation-*
- 配置名称:设计学(Design),数据文件路径:Design/validation-*
- 配置名称:诊断学与检验医学(Diagnostics_and_Laboratory_Medicine),数据文件路径:Diagnostics_and_Laboratory_Medicine/validation-*
- 配置名称:经济学(Economics),数据文件路径:Economics/validation-*
- 配置名称:电子学(Electronics),数据文件路径:Electronics/validation-*
- 配置名称:能源与动力工程(Energy_and_Power),数据文件路径:Energy_and_Power/validation-*
- 配置名称:金融学(Finance),数据文件路径:Finance/validation-*
- 配置名称:地理学(Geography),数据文件路径:Geography/validation-*
- 配置名称:历史学(History),数据文件路径:History/validation-*
- 配置名称:文学(Literature),数据文件路径:Literature/validation-*
- 配置名称:管理学(Manage),数据文件路径:Manage/validation-*
- 配置名称:市场营销学(Marketing),数据文件路径:Marketing/validation-*
- 配置名称:材料科学(Materials),数据文件路径:Materials/validation-*
- 配置名称:数学(Math),数据文件路径:Math/validation-*
- 配置名称:机械工程学(Mechanical_Engineering),数据文件路径:Mechanical_Engineering/validation-*
- 配置名称:音乐学(Music),数据文件路径:Music/validation-*
- 配置名称:药学(Pharmacy),数据文件路径:Pharmacy/validation-*
- 配置名称:物理学(Physics),数据文件路径:Physics/validation-*
- 配置名称:心理学(Psychology),数据文件路径:Psychology/validation-*
- 配置名称:公共卫生学(Public_Health),数据文件路径:Public_Health/validation-*
- 配置名称:社会学(Sociology),数据文件路径:Sociology/validation-*
提供机构:
HanxuHU
原始信息汇总
数据集概述
本数据集包含多个子数据集,每个子数据集对应不同的学科领域,具体包括会计、农业、建筑与工程、艺术、艺术理论、基础医学科学、生物学、化学、临床医学、计算机科学、设计、诊断与实验室医学、经济学、电子学、能源与电力、金融、地理、历史、文学、管理、市场营销、材料科学、数学、机械工程、音乐和药学。
数据集特征
每个子数据集包含以下特征:
id: 数据标识符,类型为字符串。question: 问题描述,类型为字符串。options: 选项,类型为字符串。explanation: 解释,类型为字符串。image_1至image_7: 图像文件,类型为图像。img_type: 图像类型,类型为字符串。answer: 答案,类型为字符串。topic_difficulty: 主题难度,类型为字符串。question_type: 问题类型,类型为字符串。subfield: 子领域,类型为字符串。
数据集分割
每个子数据集都包含一个名为validation的分割,该分割提供了数据集的大小(以字节为单位)和示例数量。
数据集大小和下载大小
每个子数据集的validation分割提供了数据集的总大小(以字节为单位)和下载大小(以字节为单位)。这些信息有助于用户了解数据集的存储和传输需求。



