OALL/Arabic_MMLU
收藏Hugging Face2024-09-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/OALL/Arabic_MMLU
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: abstract_algebra
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 29769
num_examples: 100
- name: dev
num_bytes: 1269
num_examples: 5
download_size: 19750
dataset_size: 31038
- config_name: anatomy
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 48669
num_examples: 135
- name: dev
num_bytes: 1534
num_examples: 5
download_size: 35258
dataset_size: 50203
- config_name: astronomy
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 69704
num_examples: 152
- name: dev
num_bytes: 2981
num_examples: 5
download_size: 49878
dataset_size: 72685
- config_name: business_ethics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 51514
num_examples: 100
- name: dev
num_bytes: 3288
num_examples: 5
download_size: 37704
dataset_size: 54802
- config_name: clinical_knowledge
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 102346
num_examples: 265
- name: dev
num_bytes: 1810
num_examples: 5
download_size: 63082
dataset_size: 104156
- config_name: college_biology
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 75007
num_examples: 144
- name: dev
num_bytes: 2379
num_examples: 5
download_size: 50193
dataset_size: 77386
- config_name: college_chemistry
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 37276
num_examples: 100
- name: dev
num_bytes: 2083
num_examples: 5
download_size: 31944
dataset_size: 39359
- config_name: college_computer_science
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 56979
num_examples: 100
- name: dev
num_bytes: 3415
num_examples: 5
download_size: 41297
dataset_size: 60394
- config_name: college_mathematics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 36648
num_examples: 100
- name: dev
num_bytes: 1891
num_examples: 5
download_size: 29831
dataset_size: 38539
- config_name: college_medicine
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 80163
num_examples: 173
- name: dev
num_bytes: 2650
num_examples: 5
download_size: 53862
dataset_size: 82813
- config_name: college_physics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 42431
num_examples: 102
- name: dev
num_bytes: 1828
num_examples: 5
download_size: 30292
dataset_size: 44259
- config_name: computer_security
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 39166
num_examples: 100
- name: dev
num_bytes: 1750
num_examples: 5
download_size: 31153
dataset_size: 40916
- config_name: conceptual_physics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 69000
num_examples: 235
- name: dev
num_bytes: 1537
num_examples: 5
download_size: 40421
dataset_size: 70537
- config_name: econometrics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 63979
num_examples: 114
- name: dev
num_bytes: 2364
num_examples: 5
download_size: 44448
dataset_size: 66343
- config_name: electrical_engineering
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 42482
num_examples: 145
- name: dev
num_bytes: 1680
num_examples: 5
download_size: 31774
dataset_size: 44162
- config_name: elementary_mathematics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 108603
num_examples: 378
- name: dev
num_bytes: 2078
num_examples: 5
download_size: 61970
dataset_size: 110681
- config_name: formal_logic
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 69054
num_examples: 126
- name: dev
num_bytes: 2558
num_examples: 5
download_size: 43567
dataset_size: 71612
- config_name: global_facts
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 30511
num_examples: 100
- name: dev
num_bytes: 1752
num_examples: 5
download_size: 26776
dataset_size: 32263
- config_name: high_school_biology
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 168964
num_examples: 310
- name: dev
num_bytes: 2865
num_examples: 5
download_size: 90706
dataset_size: 171829
- config_name: high_school_chemistry
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 89575
num_examples: 203
- name: dev
num_bytes: 2145
num_examples: 5
download_size: 52145
dataset_size: 91720
- config_name: high_school_computer_science
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 62039
num_examples: 100
- name: dev
num_bytes: 4358
num_examples: 5
download_size: 46934
dataset_size: 66397
- config_name: high_school_european_history
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 95458
num_examples: 165
- name: dev
num_bytes: 2434
num_examples: 5
download_size: 49160
dataset_size: 97892
- config_name: high_school_geography
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 72427
num_examples: 198
- name: dev
num_bytes: 2184
num_examples: 5
download_size: 44749
dataset_size: 74611
- config_name: high_school_government_and_politics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 107773
num_examples: 193
- name: dev
num_bytes: 2774
num_examples: 5
download_size: 63285
dataset_size: 110547
- config_name: high_school_macroeconomics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 196950
num_examples: 390
- name: dev
num_bytes: 2481
num_examples: 5
download_size: 91074
dataset_size: 199431
- config_name: high_school_mathematics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 83340
num_examples: 270
- name: dev
num_bytes: 2072
num_examples: 5
download_size: 46560
dataset_size: 85412
- config_name: high_school_microeconomics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 125185
num_examples: 238
- name: dev
num_bytes: 1952
num_examples: 5
download_size: 64821
dataset_size: 127137
- config_name: high_school_physics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 78332
num_examples: 151
- name: dev
num_bytes: 2221
num_examples: 5
download_size: 46384
dataset_size: 80553
- config_name: high_school_psychology
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 246335
num_examples: 545
- name: dev
num_bytes: 2501
num_examples: 5
download_size: 122056
dataset_size: 248836
- config_name: high_school_statistics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 148636
num_examples: 216
- name: dev
num_bytes: 3053
num_examples: 5
download_size: 83364
dataset_size: 151689
- config_name: high_school_us_history
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 117928
num_examples: 204
- name: dev
num_bytes: 2353
num_examples: 5
download_size: 45590
dataset_size: 120281
- config_name: high_school_world_history
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 138288
num_examples: 237
- name: dev
num_bytes: 2270
num_examples: 5
download_size: 57174
dataset_size: 140558
- config_name: human_aging
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 74221
num_examples: 223
- name: dev
num_bytes: 1620
num_examples: 5
download_size: 48124
dataset_size: 75841
- config_name: human_sexuality
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 49433
num_examples: 131
- name: dev
num_bytes: 1705
num_examples: 5
download_size: 36031
dataset_size: 51138
- config_name: international_law
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 79679
num_examples: 121
- name: dev
num_bytes: 3626
num_examples: 5
download_size: 58645
dataset_size: 83305
- config_name: jurisprudence
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 46821
num_examples: 108
- name: dev
num_bytes: 1705
num_examples: 5
download_size: 38797
dataset_size: 48526
- config_name: logical_fallacies
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 73002
num_examples: 163
- name: dev
num_bytes: 2225
num_examples: 5
download_size: 45485
dataset_size: 75227
- config_name: machine_learning
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 49230
num_examples: 112
- name: dev
num_bytes: 3443
num_examples: 5
download_size: 40348
dataset_size: 52673
- config_name: management
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 29353
num_examples: 103
- name: dev
num_bytes: 1262
num_examples: 5
download_size: 25701
dataset_size: 30615
- config_name: marketing
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 92377
num_examples: 234
- name: dev
num_bytes: 2487
num_examples: 5
download_size: 58101
dataset_size: 94864
- config_name: medical_genetics
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 33633
num_examples: 100
- name: dev
num_bytes: 2032
num_examples: 5
download_size: 30302
dataset_size: 35665
- config_name: miscellaneous
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 214072
num_examples: 783
- name: dev
num_bytes: 1109
num_examples: 5
download_size: 123867
dataset_size: 215181
- config_name: moral_disputes
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 163324
num_examples: 346
- name: dev
num_bytes: 2599
num_examples: 5
download_size: 92773
dataset_size: 165923
- config_name: moral_scenarios
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 632998
num_examples: 895
- name: dev
num_bytes: 3372
num_examples: 5
download_size: 167360
dataset_size: 636370
- config_name: nutrition
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 143862
num_examples: 306
- name: dev
num_bytes: 3217
num_examples: 5
download_size: 86988
dataset_size: 147079
- config_name: philosophy
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 112934
num_examples: 311
- name: dev
num_bytes: 1375
num_examples: 5
download_size: 67743
dataset_size: 114309
- config_name: prehistory
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 136174
num_examples: 324
- name: dev
num_bytes: 2840
num_examples: 5
download_size: 82678
dataset_size: 139014
- config_name: professional_accounting
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 178116
num_examples: 282
- name: dev
num_bytes: 2765
num_examples: 5
download_size: 98823
dataset_size: 180881
- config_name: professional_law
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 1771393
num_examples: 1534
- name: dev
num_bytes: 6926
num_examples: 5
download_size: 833880
dataset_size: 1778319
- config_name: professional_medicine
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 166458
num_examples: 272
- name: dev
num_bytes: 2964
num_examples: 5
download_size: 78692
dataset_size: 169422
- config_name: professional_psychology
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 313950
num_examples: 612
- name: dev
num_bytes: 3183
num_examples: 5
download_size: 167005
dataset_size: 317133
- config_name: public_relations
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 42175
num_examples: 110
- name: dev
num_bytes: 2266
num_examples: 5
download_size: 34096
dataset_size: 44441
- config_name: security_studies
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 288908
num_examples: 245
- name: dev
num_bytes: 7190
num_examples: 5
download_size: 162137
dataset_size: 296098
- config_name: sociology
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 97962
num_examples: 201
- name: dev
num_bytes: 2490
num_examples: 5
download_size: 62735
dataset_size: 100452
- config_name: us_foreign_policy
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 47525
num_examples: 100
- name: dev
num_bytes: 2725
num_examples: 5
download_size: 35472
dataset_size: 50250
- config_name: virology
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 63377
num_examples: 166
- name: dev
num_bytes: 1777
num_examples: 5
download_size: 42481
dataset_size: 65154
- config_name: world_religions
features:
- name: question
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: answer
dtype: string
- name: subject
dtype: string
splits:
- name: test
num_bytes: 40435
num_examples: 171
- name: dev
num_bytes: 1000
num_examples: 5
download_size: 28872
dataset_size: 41435
configs:
- config_name: abstract_algebra
data_files:
- split: test
path: abstract_algebra/test-*
- split: dev
path: abstract_algebra/dev-*
- config_name: anatomy
data_files:
- split: test
path: anatomy/test-*
- split: dev
path: anatomy/dev-*
- config_name: astronomy
data_files:
- split: test
path: astronomy/test-*
- split: dev
path: astronomy/dev-*
- config_name: business_ethics
data_files:
- split: test
path: business_ethics/test-*
- split: dev
path: business_ethics/dev-*
- config_name: clinical_knowledge
data_files:
- split: test
path: clinical_knowledge/test-*
- split: dev
path: clinical_knowledge/dev-*
- config_name: college_biology
data_files:
- split: test
path: college_biology/test-*
- split: dev
path: college_biology/dev-*
- config_name: college_chemistry
data_files:
- split: test
path: college_chemistry/test-*
- split: dev
path: college_chemistry/dev-*
- config_name: college_computer_science
data_files:
- split: test
path: college_computer_science/test-*
- split: dev
path: college_computer_science/dev-*
- config_name: college_mathematics
data_files:
- split: test
path: college_mathematics/test-*
- split: dev
path: college_mathematics/dev-*
- config_name: college_medicine
data_files:
- split: test
path: college_medicine/test-*
- split: dev
path: college_medicine/dev-*
- config_name: college_physics
data_files:
- split: test
path: college_physics/test-*
- split: dev
path: college_physics/dev-*
- config_name: computer_security
data_files:
- split: test
path: computer_security/test-*
- split: dev
path: computer_security/dev-*
- config_name: conceptual_physics
data_files:
- split: test
path: conceptual_physics/test-*
- split: dev
path: conceptual_physics/dev-*
- config_name: econometrics
data_files:
- split: test
path: econometrics/test-*
- split: dev
path: econometrics/dev-*
- config_name: electrical_engineering
data_files:
- split: test
path: electrical_engineering/test-*
- split: dev
path: electrical_engineering/dev-*
- config_name: elementary_mathematics
data_files:
- split: test
path: elementary_mathematics/test-*
- split: dev
path: elementary_mathematics/dev-*
- config_name: formal_logic
data_files:
- split: test
path: formal_logic/test-*
- split: dev
path: formal_logic/dev-*
- config_name: global_facts
data_files:
- split: test
path: global_facts/test-*
- split: dev
path: global_facts/dev-*
- config_name: high_school_biology
data_files:
- split: test
path: high_school_biology/test-*
- split: dev
path: high_school_biology/dev-*
- config_name: high_school_chemistry
data_files:
- split: test
path: high_school_chemistry/test-*
- split: dev
path: high_school_chemistry/dev-*
- config_name: high_school_computer_science
data_files:
- split: test
path: high_school_computer_science/test-*
- split: dev
path: high_school_computer_science/dev-*
- config_name: high_school_european_history
data_files:
- split: test
path: high_school_european_history/test-*
- split: dev
path: high_school_european_history/dev-*
- config_name: high_school_geography
data_files:
- split: test
path: high_school_geography/test-*
- split: dev
path: high_school_geography/dev-*
- config_name: high_school_government_and_politics
data_files:
- split: test
path: high_school_government_and_politics/test-*
- split: dev
path: high_school_government_and_politics/dev-*
- config_name: high_school_macroeconomics
data_files:
- split: test
path: high_school_macroeconomics/test-*
- split: dev
path: high_school_macroeconomics/dev-*
- config_name: high_school_mathematics
data_files:
- split: test
path: high_school_mathematics/test-*
- split: dev
path: high_school_mathematics/dev-*
- config_name: high_school_microeconomics
data_files:
- split: test
path: high_school_microeconomics/test-*
- split: dev
path: high_school_microeconomics/dev-*
- config_name: high_school_physics
data_files:
- split: test
path: high_school_physics/test-*
- split: dev
path: high_school_physics/dev-*
- config_name: high_school_psychology
data_files:
- split: test
path: high_school_psychology/test-*
- split: dev
path: high_school_psychology/dev-*
- config_name: high_school_statistics
data_files:
- split: test
path: high_school_statistics/test-*
- split: dev
path: high_school_statistics/dev-*
- config_name: high_school_us_history
data_files:
- split: test
path: high_school_us_history/test-*
- split: dev
path: high_school_us_history/dev-*
- config_name: high_school_world_history
data_files:
- split: test
path: high_school_world_history/test-*
- split: dev
path: high_school_world_history/dev-*
- config_name: human_aging
data_files:
- split: test
path: human_aging/test-*
- split: dev
path: human_aging/dev-*
- config_name: human_sexuality
data_files:
- split: test
path: human_sexuality/test-*
- split: dev
path: human_sexuality/dev-*
- config_name: international_law
data_files:
- split: test
path: international_law/test-*
- split: dev
path: international_law/dev-*
- config_name: jurisprudence
data_files:
- split: test
path: jurisprudence/test-*
- split: dev
path: jurisprudence/dev-*
- config_name: logical_fallacies
data_files:
- split: test
path: logical_fallacies/test-*
- split: dev
path: logical_fallacies/dev-*
- config_name: machine_learning
data_files:
- split: test
path: machine_learning/test-*
- split: dev
path: machine_learning/dev-*
- config_name: management
data_files:
- split: test
path: management/test-*
- split: dev
path: management/dev-*
- config_name: marketing
data_files:
- split: test
path: marketing/test-*
- split: dev
path: marketing/dev-*
- config_name: medical_genetics
data_files:
- split: test
path: medical_genetics/test-*
- split: dev
path: medical_genetics/dev-*
- config_name: miscellaneous
data_files:
- split: test
path: miscellaneous/test-*
- split: dev
path: miscellaneous/dev-*
- config_name: moral_disputes
data_files:
- split: test
path: moral_disputes/test-*
- split: dev
path: moral_disputes/dev-*
- config_name: moral_scenarios
data_files:
- split: test
path: moral_scenarios/test-*
- split: dev
path: moral_scenarios/dev-*
- config_name: nutrition
data_files:
- split: test
path: nutrition/test-*
- split: dev
path: nutrition/dev-*
- config_name: philosophy
data_files:
- split: test
path: philosophy/test-*
- split: dev
path: philosophy/dev-*
- config_name: prehistory
data_files:
- split: test
path: prehistory/test-*
- split: dev
path: prehistory/dev-*
- config_name: professional_accounting
data_files:
- split: test
path: professional_accounting/test-*
- split: dev
path: professional_accounting/dev-*
- config_name: professional_law
data_files:
- split: test
path: professional_law/test-*
- split: dev
path: professional_law/dev-*
- config_name: professional_medicine
data_files:
- split: test
path: professional_medicine/test-*
- split: dev
path: professional_medicine/dev-*
- config_name: professional_psychology
data_files:
- split: test
path: professional_psychology/test-*
- split: dev
path: professional_psychology/dev-*
- config_name: public_relations
data_files:
- split: test
path: public_relations/test-*
- split: dev
path: public_relations/dev-*
- config_name: security_studies
data_files:
- split: test
path: security_studies/test-*
- split: dev
path: security_studies/dev-*
- config_name: sociology
data_files:
- split: test
path: sociology/test-*
- split: dev
path: sociology/dev-*
- config_name: us_foreign_policy
data_files:
- split: test
path: us_foreign_policy/test-*
- split: dev
path: us_foreign_policy/dev-*
- config_name: virology
data_files:
- split: test
path: virology/test-*
- split: dev
path: virology/dev-*
- config_name: world_religions
data_files:
- split: test
path: world_religions/test-*
- split: dev
path: world_religions/dev-*
---
This dataset belongs to [FreedomIntelligence](https://huggingface.co/FreedomIntelligence) and the original version can be found here : https://github.com/FreedomIntelligence/AceGPT/tree/main/eval/benchmark_eval/benchmarks/MMLUArabic
提供机构:
OALL
原始信息汇总
数据集概述
该数据集包含多个子集,每个子集对应一个特定的学科领域。每个子集包含以下特征:
- question: 问题,数据类型为字符串。
- A: 选项A,数据类型为字符串。
- B: 选项B,数据类型为字符串。
- C: 选项C,数据类型为字符串。
- D: 选项D,数据类型为字符串。
- answer: 答案,数据类型为字符串。
- subject: 学科,数据类型为字符串。
每个子集分为两个数据分割:
- test: 测试集,包含一定数量的字节和样本。
- dev: 开发集,包含一定数量的字节和样本。
以下是各子集的详细信息:
学科子集列表
抽象代数 (abstract_algebra)
- test: 29769 字节, 100 样本
- dev: 1269 字节, 5 样本
- 下载大小: 19750 字节
- 数据集大小: 31038 字节
解剖学 (anatomy)
- test: 48669 字节, 135 样本
- dev: 1534 字节, 5 样本
- 下载大小: 35258 字节
- 数据集大小: 50203 字节
天文学 (astronomy)
- test: 69704 字节, 152 样本
- dev: 2981 字节, 5 样本
- 下载大小: 49878 字节
- 数据集大小: 72685 字节
商业伦理 (business_ethics)
- test: 51514 字节, 100 样本
- dev: 3288 字节, 5 样本
- 下载大小: 37704 字节
- 数据集大小: 54802 字节
临床知识 (clinical_knowledge)
- test: 102346 字节, 265 样本
- dev: 1810 字节, 5 样本
- 下载大小: 63082 字节
- 数据集大小: 104156 字节
大学生物学 (college_biology)
- test: 75007 字节, 144 样本
- dev: 2379 字节, 5 样本
- 下载大小: 50193 字节
- 数据集大小: 77386 字节
大学化学 (college_chemistry)
- test: 37276 字节, 100 样本
- dev: 2083 字节, 5 样本
- 下载大小: 31944 字节
- 数据集大小: 39359 字节
大学计算机科学 (college_computer_science)
- test: 56979 字节, 100 样本
- dev: 3415 字节, 5 样本
- 下载大小: 41297 字节
- 数据集大小: 60394 字节
大学数学 (college_mathematics)
- test: 36648 字节, 100 样本
- dev: 1891 字节, 5 样本
- 下载大小: 29831 字节
- 数据集大小: 38539 字节
大学医学 (college_medicine)
- test: 80163 字节, 173 样本
- dev: 2650 字节, 5 样本
- 下载大小: 53862 字节
- 数据集大小: 82813 字节
大学物理 (college_physics)
- test: 42431 字节, 102 样本
- dev: 1828 字节, 5 样本
- 下载大小: 30292 字节
- 数据集大小: 44259 字节
计算机安全 (computer_security)
- test: 39166 字节, 100 样本
- dev: 1750 字节, 5 样本
- 下载大小: 31153 字节
- 数据集大小: 40916 字节
概念物理 (conceptual_physics)
- test: 69000 字节, 235 样本
- dev: 1537 字节, 5 样本
- 下载大小: 40421 字节
- 数据集大小: 70537 字节
计量经济学 (econometrics)
- test: 63979 字节, 114 样本
- dev: 2364 字节, 5 样本
- 下载大小: 44448 字节
- 数据集大小: 66343 字节
电气工程 (electrical_engineering)
- test: 42482 字节, 145 样本
- dev: 1680 字节, 5 样本
- 下载大小: 31774 字节
- 数据集大小: 44162 字节
初等数学 (elementary_mathematics)
- test: 108603 字节, 378 样本
- dev: 2078 字节, 5 样本
- 下载大小: 61970 字节
- 数据集大小: 110681 字节
形式逻辑 (formal_logic)
- test: 69054 字节, 126 样本
- dev: 2558 字节, 5 样本
- 下载大小: 43567 字节
- 数据集大小: 71612 字节
全球事实 (global_facts)
- test: 30511 字节, 100 样本
- dev: 1752 字节, 5 样本
- 下载大小: 26776 字节
- 数据集大小: 32263 字节
高中生物学 (high_school_biology)
- test: 168964 字节, 310 样本
- dev: 2865 字节, 5 样本
- 下载大小: 90706 字节
- 数据集大小: 171829 字节
高中化学 (high_school_chemistry)
- test: 89575 字节, 203 样本
- dev: 2145 字节, 5 样本
- 下载大小: 52145 字节
- 数据集大小: 91720 字节
高中计算机科学 (high_school_computer_science)
- test: 62039 字节, 100 样本
- dev: 4358 字节, 5 样本
- 下载大小: 46934 字节
- 数据集大小: 66397 字节
高中欧洲历史 (high_school_european_history)
- test: 95458 字节, 165 样本
- dev: 2434 字节, 5 样本
- 下载大小: 49160 字节
- 数据集大小: 97892 字节
高中地理 (high_school_geography)
- test: 72427 字节, 198 样本
- dev: 2184 字节, 5 样本
- 下载大小: 44749 字节
- 数据集大小: 74611 字节
高中政府与政治 (high_school_government_and_politics)
- test: 107773 字节, 193 样本
- dev: 2774 字节, 5 样本
- 下载大小: 63285 字节
- 数据集大小: 110547 字节
高中宏观经济学 (high_school_macroeconomics)
- test: 196950 字节, 390 样本
- dev: 2481 字节, 5 样本
- 下载大小: 91074 字节
- 数据集大小: 199431 字节
高中数学 (high_school_mathematics)
- test: 83340 字节, 270 样本
- dev: 2072 字节, 5 样本
- 下载大小: 46560 字节
- 数据集大小: 85412 字节
高中微观经济学 (high_school_microeconomics)
- test: 125185 字节, 238 样本
- dev: 1952 字节, 5 样本
- 下载大小: 64821 字节
- 数据集大小: 127137 字节
高中物理 (high_school_physics)
- test: 78332 字节, 151 样本
- dev: 2221 字节, 5 样本
- 下载大小: 46384 字节
- 数据集大小: 80553 字节
高中心理学 (high_school_psychology)
- test: 246335 字节, 545 样本
- dev: 2501 字节, 5 样本
- 下载大小: 122056 字节
- 数据集大小: 248836 字节
高中统计学 (high_school_statistics)
- test: 148636 字节, 216 样本
- dev: 3053 字节, 5 样本
- 下载大小: 83364 字节
- 数据集大小: 151689 字



