five

malhajar/mmlu-tr

收藏
Hugging Face2024-03-05 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/malhajar/mmlu-tr
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - tr license: apache-2.0 task_categories: - text-classification - multiple-choice - question-answering task_ids: - multiple-choice-qa - open-domain-qa - closed-domain-qa tags: - multi-task - multitask - mmlu - hendrycks_test dataset_info: - config_name: abstract_algebra features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1213 num_examples: 4 - name: test num_bytes: 30380 num_examples: 99 - name: validation num_bytes: 2990 num_examples: 10 - config_name: anatomy features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1023 num_examples: 4 - name: test num_bytes: 44968 num_examples: 134 - name: validation num_bytes: 4074 num_examples: 13 - config_name: astronomy features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2775 num_examples: 4 - name: test num_bytes: 72243 num_examples: 151 - name: validation num_bytes: 6884 num_examples: 15 - config_name: business_ethics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2754 num_examples: 4 - name: test num_bytes: 47509 num_examples: 99 - name: validation num_bytes: 4131 num_examples: 10 - config_name: clinical_knowledge features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1615 num_examples: 4 - name: test num_bytes: 92165 num_examples: 264 - name: validation num_bytes: 9846 num_examples: 28 - config_name: college_biology features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1514 num_examples: 4 - name: test num_bytes: 70502 num_examples: 143 - name: validation num_bytes: 7086 num_examples: 15 - config_name: college_chemistry features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1350 num_examples: 4 - name: test num_bytes: 35099 num_examples: 99 - name: validation num_bytes: 2807 num_examples: 7 - config_name: college_computer_science features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 3582 num_examples: 4 - name: test num_bytes: 64366 num_examples: 99 - name: validation num_bytes: 6475 num_examples: 10 - config_name: college_mathematics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1696 num_examples: 4 - name: test num_bytes: 35750 num_examples: 99 - name: validation num_bytes: 3410 num_examples: 10 - config_name: college_medicine features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2090 num_examples: 4 - name: test num_bytes: 119254 num_examples: 172 - name: validation num_bytes: 10820 num_examples: 21 - config_name: college_physics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1504 num_examples: 4 - name: test num_bytes: 41574 num_examples: 101 - name: validation num_bytes: 4353 num_examples: 10 - config_name: computer_security features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1681 num_examples: 4 - name: test num_bytes: 43455 num_examples: 99 - name: validation num_bytes: 6697 num_examples: 10 - config_name: conceptual_physics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1210 num_examples: 4 - name: test num_bytes: 63735 num_examples: 234 - name: validation num_bytes: 6752 num_examples: 25 - config_name: econometrics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1997 num_examples: 4 - name: test num_bytes: 65356 num_examples: 113 - name: validation num_bytes: 6793 num_examples: 11 - config_name: electrical_engineering features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1207 num_examples: 4 - name: test num_bytes: 38344 num_examples: 144 - name: validation num_bytes: 4115 num_examples: 15 - config_name: elementary_mathematics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1728 num_examples: 4 - name: test num_bytes: 100660 num_examples: 377 - name: validation num_bytes: 12903 num_examples: 40 - config_name: formal_logic features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2365 num_examples: 4 - name: test num_bytes: 73028 num_examples: 125 - name: validation num_bytes: 8768 num_examples: 13 - config_name: global_facts features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1118 num_examples: 4 - name: test num_bytes: 29486 num_examples: 99 - name: validation num_bytes: 2736 num_examples: 9 - config_name: high_school_biology features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2162 num_examples: 4 - name: test num_bytes: 156715 num_examples: 309 - name: validation num_bytes: 14527 num_examples: 31 - config_name: high_school_chemistry features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1656 num_examples: 4 - name: test num_bytes: 82374 num_examples: 202 - name: validation num_bytes: 9753 num_examples: 21 - config_name: high_school_computer_science features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 3770 num_examples: 4 - name: test num_bytes: 67680 num_examples: 99 - name: validation num_bytes: 4744 num_examples: 8 - config_name: high_school_european_history features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 13380 num_examples: 4 - name: test num_bytes: 379904 num_examples: 164 - name: validation num_bytes: 38640 num_examples: 17 - config_name: high_school_geography features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1903 num_examples: 4 - name: test num_bytes: 64542 num_examples: 197 - name: validation num_bytes: 6151 num_examples: 21 - config_name: high_school_government_and_politics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1931 num_examples: 4 - name: test num_bytes: 98507 num_examples: 192 - name: validation num_bytes: 9710 num_examples: 20 - config_name: high_school_macroeconomics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1568 num_examples: 4 - name: test num_bytes: 175522 num_examples: 389 - name: validation num_bytes: 18938 num_examples: 42 - config_name: high_school_mathematics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1183 num_examples: 4 - name: test num_bytes: 76921 num_examples: 269 - name: validation num_bytes: 7961 num_examples: 28 - config_name: high_school_microeconomics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1197 num_examples: 4 - name: test num_bytes: 110403 num_examples: 237 - name: validation num_bytes: 10736 num_examples: 25 - config_name: high_school_physics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1789 num_examples: 4 - name: test num_bytes: 84860 num_examples: 150 - name: validation num_bytes: 8807 num_examples: 16 - config_name: high_school_psychology features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2191 num_examples: 4 - name: test num_bytes: 237454 num_examples: 544 - name: validation num_bytes: 25261 num_examples: 59 - config_name: high_school_statistics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2829 num_examples: 4 - name: test num_bytes: 160308 num_examples: 215 - name: validation num_bytes: 14465 num_examples: 22 - config_name: high_school_us_history features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 11136 num_examples: 4 - name: test num_bytes: 427246 num_examples: 203 - name: validation num_bytes: 44180 num_examples: 21 - config_name: high_school_world_history features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 6339 num_examples: 4 - name: test num_bytes: 544262 num_examples: 236 - name: validation num_bytes: 63826 num_examples: 25 - config_name: human_aging features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1301 num_examples: 4 - name: test num_bytes: 72894 num_examples: 222 - name: validation num_bytes: 7047 num_examples: 22 - config_name: human_sexuality features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1286 num_examples: 4 - name: test num_bytes: 46845 num_examples: 130 - name: validation num_bytes: 3231 num_examples: 11 - config_name: international_law features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2841 num_examples: 4 - name: test num_bytes: 78414 num_examples: 120 - name: validation num_bytes: 8742 num_examples: 12 - config_name: jurisprudence features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1336 num_examples: 4 - name: test num_bytes: 49177 num_examples: 107 - name: validation num_bytes: 5453 num_examples: 10 - config_name: logical_fallacies features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1958 num_examples: 4 - name: test num_bytes: 76985 num_examples: 162 - name: validation num_bytes: 7516 num_examples: 17 - config_name: machine_learning features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 3179 num_examples: 4 - name: test num_bytes: 54414 num_examples: 111 - name: validation num_bytes: 4357 num_examples: 10 - config_name: management features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 870 num_examples: 4 - name: test num_bytes: 29869 num_examples: 102 - name: validation num_bytes: 2530 num_examples: 10 - config_name: marketing features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1977 num_examples: 4 - name: test num_bytes: 95368 num_examples: 233 - name: validation num_bytes: 10670 num_examples: 24 - config_name: medical_genetics features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1297 num_examples: 4 - name: test num_bytes: 29741 num_examples: 99 - name: validation num_bytes: 3815 num_examples: 10 - config_name: miscellaneous features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 874 num_examples: 4 - name: test num_bytes: 223389 num_examples: 782 - name: validation num_bytes: 21001 num_examples: 85 - config_name: moral_disputes features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1842 num_examples: 4 - name: test num_bytes: 165916 num_examples: 345 - name: validation num_bytes: 18415 num_examples: 37 - config_name: moral_scenarios features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2582 num_examples: 4 - name: test num_bytes: 614251 num_examples: 894 - name: validation num_bytes: 68302 num_examples: 99 - config_name: nutrition features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2212 num_examples: 4 - name: test num_bytes: 135605 num_examples: 305 - name: validation num_bytes: 11919 num_examples: 32 - config_name: philosophy features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 997 num_examples: 4 - name: test num_bytes: 121539 num_examples: 310 - name: validation num_bytes: 12763 num_examples: 33 - config_name: prehistory features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2269 num_examples: 4 - name: test num_bytes: 132441 num_examples: 323 - name: validation num_bytes: 15041 num_examples: 34 - config_name: professional_accounting features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2409 num_examples: 4 - name: test num_bytes: 178410 num_examples: 281 - name: validation num_bytes: 20331 num_examples: 30 - config_name: professional_law features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 7449 num_examples: 4 - name: test num_bytes: 2730513 num_examples: 1533 - name: validation num_bytes: 294872 num_examples: 169 - config_name: professional_medicine features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 3669 num_examples: 4 - name: test num_bytes: 298852 num_examples: 271 - name: validation num_bytes: 31340 num_examples: 30 - config_name: professional_psychology features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1936 num_examples: 4 - name: test num_bytes: 337821 num_examples: 611 - name: validation num_bytes: 43121 num_examples: 68 - config_name: public_relations features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1592 num_examples: 4 - name: test num_bytes: 42078 num_examples: 109 - name: validation num_bytes: 6406 num_examples: 11 - config_name: security_studies features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 5725 num_examples: 4 - name: test num_bytes: 307394 num_examples: 244 - name: validation num_bytes: 32839 num_examples: 26 - config_name: sociology features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2078 num_examples: 4 - name: test num_bytes: 100739 num_examples: 200 - name: validation num_bytes: 10419 num_examples: 21 - config_name: us_foreign_policy features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 2098 num_examples: 4 - name: test num_bytes: 41654 num_examples: 99 - name: validation num_bytes: 4116 num_examples: 10 - config_name: virology features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 1305 num_examples: 4 - name: test num_bytes: 59351 num_examples: 165 - name: validation num_bytes: 8059 num_examples: 17 - config_name: world_religions features: - name: question dtype: string - name: choices sequence: string - name: answer dtype: int64 splits: - name: dev num_bytes: 737 num_examples: 4 - name: test num_bytes: 35616 num_examples: 170 - name: validation num_bytes: 3704 num_examples: 18 configs: - config_name: abstract_algebra data_files: - split: dev path: abstract_algebra/dev-* - split: test path: abstract_algebra/test-* - split: validation path: abstract_algebra/validation-* - config_name: anatomy data_files: - split: dev path: anatomy/dev-* - split: test path: anatomy/test-* - split: validation path: anatomy/validation-* - config_name: astronomy data_files: - split: dev path: astronomy/dev-* - split: test path: astronomy/test-* - split: validation path: astronomy/validation-* - config_name: business_ethics data_files: - split: dev path: business_ethics/dev-* - split: test path: business_ethics/test-* - split: validation path: business_ethics/validation-* - config_name: clinical_knowledge data_files: - split: dev path: clinical_knowledge/dev-* - split: test path: clinical_knowledge/test-* - split: validation path: clinical_knowledge/validation-* - config_name: college_biology data_files: - split: dev path: college_biology/dev-* - split: test path: college_biology/test-* - split: validation path: college_biology/validation-* - config_name: college_chemistry data_files: - split: dev path: college_chemistry/dev-* - split: test path: college_chemistry/test-* - split: validation path: college_chemistry/validation-* - config_name: college_computer_science data_files: - split: dev path: college_computer_science/dev-* - split: test path: college_computer_science/test-* - split: validation path: college_computer_science/validation-* - config_name: college_mathematics data_files: - split: dev path: college_mathematics/dev-* - split: test path: college_mathematics/test-* - split: validation path: college_mathematics/validation-* - config_name: college_medicine data_files: - split: dev path: college_medicine/dev-* - split: test path: college_medicine/test-* - split: validation path: college_medicine/validation-* - config_name: college_physics data_files: - split: dev path: college_physics/dev-* - split: test path: college_physics/test-* - split: validation path: college_physics/validation-* - config_name: computer_security data_files: - split: dev path: computer_security/dev-* - split: test path: computer_security/test-* - split: validation path: computer_security/validation-* - config_name: conceptual_physics data_files: - split: dev path: conceptual_physics/dev-* - split: test path: conceptual_physics/test-* - split: validation path: conceptual_physics/validation-* - config_name: econometrics data_files: - split: dev path: econometrics/dev-* - split: test path: econometrics/test-* - split: validation path: econometrics/validation-* - config_name: electrical_engineering data_files: - split: dev path: electrical_engineering/dev-* - split: test path: electrical_engineering/test-* - split: validation path: electrical_engineering/validation-* - config_name: elementary_mathematics data_files: - split: dev path: elementary_mathematics/dev-* - split: test path: elementary_mathematics/test-* - split: validation path: elementary_mathematics/validation-* - config_name: formal_logic data_files: - split: dev path: formal_logic/dev-* - split: test path: formal_logic/test-* - split: validation path: formal_logic/validation-* - config_name: global_facts data_files: - split: dev path: global_facts/dev-* - split: test path: global_facts/test-* - split: validation path: global_facts/validation-* - config_name: high_school_biology data_files: - split: dev path: high_school_biology/dev-* - split: test path: high_school_biology/test-* - split: validation path: high_school_biology/validation-* - config_name: high_school_chemistry data_files: - split: dev path: high_school_chemistry/dev-* - split: test path: high_school_chemistry/test-* - split: validation path: high_school_chemistry/validation-* - config_name: high_school_computer_science data_files: - split: dev path: high_school_computer_science/dev-* - split: test path: high_school_computer_science/test-* - split: validation path: high_school_computer_science/validation-* - config_name: high_school_european_history data_files: - split: dev path: high_school_european_history/dev-* - split: test path: high_school_european_history/test-* - split: validation path: high_school_european_history/validation-* - config_name: high_school_geography data_files: - split: dev path: high_school_geography/dev-* - split: test path: high_school_geography/test-* - split: validation path: high_school_geography/validation-* - config_name: high_school_government_and_politics data_files: - split: dev path: high_school_government_and_politics/dev-* - split: test path: high_school_government_and_politics/test-* - split: validation path: high_school_government_and_politics/validation-* - config_name: high_school_macroeconomics data_files: - split: dev path: high_school_macroeconomics/dev-* - split: test path: high_school_macroeconomics/test-* - split: validation path: high_school_macroeconomics/validation-* - config_name: high_school_mathematics data_files: - split: dev path: high_school_mathematics/dev-* - split: test path: high_school_mathematics/test-* - split: validation path: high_school_mathematics/validation-* - config_name: high_school_microeconomics data_files: - split: dev path: high_school_microeconomics/dev-* - split: test path: high_school_microeconomics/test-* - split: validation path: high_school_microeconomics/validation-* - config_name: high_school_physics data_files: - split: dev path: high_school_physics/dev-* - split: test path: high_school_physics/test-* - split: validation path: high_school_physics/validation-* - config_name: high_school_psychology data_files: - split: dev path: high_school_psychology/dev-* - split: test path: high_school_psychology/test-* - split: validation path: high_school_psychology/validation-* - config_name: high_school_statistics data_files: - split: dev path: high_school_statistics/dev-* - split: test path: high_school_statistics/test-* - split: validation path: high_school_statistics/validation-* - config_name: high_school_us_history data_files: - split: dev path: high_school_us_history/dev-* - split: test path: high_school_us_history/test-* - split: validation path: high_school_us_history/validation-* - config_name: high_school_world_history data_files: - split: dev path: high_school_world_history/dev-* - split: test path: high_school_world_history/test-* - split: validation path: high_school_world_history/validation-* - config_name: human_aging data_files: - split: dev path: human_aging/dev-* - split: test path: human_aging/test-* - split: validation path: human_aging/validation-* - config_name: human_sexuality data_files: - split: dev path: human_sexuality/dev-* - split: test path: human_sexuality/test-* - split: validation path: human_sexuality/validation-* - config_name: international_law data_files: - split: dev path: international_law/dev-* - split: test path: international_law/test-* - split: validation path: international_law/validation-* - config_name: jurisprudence data_files: - split: dev path: jurisprudence/dev-* - split: test path: jurisprudence/test-* - split: validation path: jurisprudence/validation-* - config_name: logical_fallacies data_files: - split: dev path: logical_fallacies/dev-* - split: test path: logical_fallacies/test-* - split: validation path: logical_fallacies/validation-* - config_name: machine_learning data_files: - split: dev path: machine_learning/dev-* - split: test path: machine_learning/test-* - split: validation path: machine_learning/validation-* - config_name: management data_files: - split: dev path: management/dev-* - split: test path: management/test-* - split: validation path: management/validation-* - config_name: marketing data_files: - split: dev path: marketing/dev-* - split: test path: marketing/test-* - split: validation path: marketing/validation-* - config_name: medical_genetics data_files: - split: dev path: medical_genetics/dev-* - split: test path: medical_genetics/test-* - split: validation path: medical_genetics/validation-* - config_name: miscellaneous data_files: - split: dev path: miscellaneous/dev-* - split: test path: miscellaneous/test-* - split: validation path: miscellaneous/validation-* - config_name: moral_disputes data_files: - split: dev path: moral_disputes/dev-* - split: test path: moral_disputes/test-* - split: validation path: moral_disputes/validation-* - config_name: moral_scenarios data_files: - split: dev path: moral_scenarios/dev-* - split: test path: moral_scenarios/test-* - split: validation path: moral_scenarios/validation-* - config_name: nutrition data_files: - split: dev path: nutrition/dev-* - split: test path: nutrition/test-* - split: validation path: nutrition/validation-* - config_name: philosophy data_files: - split: dev path: philosophy/dev-* - split: test path: philosophy/test-* - split: validation path: philosophy/validation-* - config_name: prehistory data_files: - split: dev path: prehistory/dev-* - split: test path: prehistory/test-* - split: validation path: prehistory/validation-* - config_name: professional_accounting data_files: - split: dev path: professional_accounting/dev-* - split: test path: professional_accounting/test-* - split: validation path: professional_accounting/validation-* - config_name: professional_law data_files: - split: dev path: professional_law/dev-* - split: test path: professional_law/test-* - split: validation path: professional_law/validation-* - config_name: professional_medicine data_files: - split: dev path: professional_medicine/dev-* - split: test path: professional_medicine/test-* - split: validation path: professional_medicine/validation-* - config_name: professional_psychology data_files: - split: dev path: professional_psychology/dev-* - split: test path: professional_psychology/test-* - split: validation path: professional_psychology/validation-* - config_name: public_relations data_files: - split: dev path: public_relations/dev-* - split: test path: public_relations/test-* - split: validation path: public_relations/validation-* - config_name: security_studies data_files: - split: dev path: security_studies/dev-* - split: test path: security_studies/test-* - split: validation path: security_studies/validation-* - config_name: sociology data_files: - split: dev path: sociology/dev-* - split: test path: sociology/test-* - split: validation path: sociology/validation-* - config_name: us_foreign_policy data_files: - split: dev path: us_foreign_policy/dev-* - split: test path: us_foreign_policy/test-* - split: validation path: us_foreign_policy/validation-* - config_name: virology data_files: - split: dev path: virology/dev-* - split: test path: virology/test-* - split: validation path: virology/validation-* - config_name: world_religions data_files: - split: dev path: world_religions/dev-* - split: test path: world_religions/test-* - split: validation path: world_religions/validation-* --- This Dataset is part of a series of datasets aimed at advancing Turkish LLM Developments by establishing rigid Turkish benchmarks to evaluate the performance of LLM's Produced in the Turkish Language. # Dataset Card for mmlu-tr malhajar/mmlu-tr is a translated version of [`mmlu`](https://huggingface.co/datasets/tasksource/mmlu) aimed specifically to be used in the [`OpenLLMTurkishLeaderboard`](https://huggingface.co/spaces/malhajar/OpenLLMTurkishLeaderboard) MMLU (`hendrycks_test` on huggingface) without auxiliary train. It is much lighter (7MB vs 162MB) and faster than the original implementation, in which auxiliary train is loaded (+ duplicated!) by default for all the configs in the original version, making it quite heavy. Reference to original dataset: Measuring Massive Multitask Language Understanding - https://github.com/hendrycks/test ## Dataset Description - **Paper:** [Measuring Massive Multitask Language Understanding](https://arxiv.org/abs/2009.03300) - **Leaderboard:** [OpenLLMTurkishLeaderboard](https://huggingface.co/spaces/malhajar/OpenLLMTurkishLeaderboard) ### Supported Tasks and Leaderboards This dataset are defined specifically to be used in [`OpenLLMTurkishLeaderboard`](https://huggingface.co/spaces/malhajar/OpenLLMTurkishLeaderboard) ### Languages The text in the dataset is in Turkish. ### Contributions This dataset was translated by [`Mohamad Alhajar`](https://www.linkedin.com/in/muhammet-alhajar/) ``` @article{hendryckstest2021, title={Measuring Massive Multitask Language Understanding}, author={Dan Hendrycks and Collin Burns and Steven Basart and Andy Zou and Mantas Mazeika and Dawn Song and Jacob Steinhardt}, journal={Proceedings of the International Conference on Learning Representations (ICLR)}, year={2021} }
提供机构:
malhajar
原始信息汇总

数据集概述

该数据集包含多个配置,每个配置对应不同的学科领域,涵盖了从高中到大学以及专业级别的知识。每个配置包含以下特征:

  • question: 问题,数据类型为字符串。
  • choices: 选项,数据类型为字符串序列。
  • answer: 答案,数据类型为整数(int64)。

每个配置还包含以下数据分割:

  • dev: 开发集
  • test: 测试集
  • validation: 验证集

数据集配置详情

学科配置列表

  • abstract_algebra
  • anatomy
  • astronomy
  • business_ethics
  • clinical_knowledge
  • college_biology
  • college_chemistry
  • college_computer_science
  • college_mathematics
  • college_medicine
  • college_physics
  • computer_security
  • conceptual_physics
  • econometrics
  • electrical_engineering
  • elementary_mathematics
  • formal_logic
  • global_facts
  • high_school_biology
  • high_school_chemistry
  • high_school_computer_science
  • high_school_european_history
  • high_school_geography
  • high_school_government_and_politics
  • high_school_macroeconomics
  • high_school_mathematics
  • high_school_microeconomics
  • high_school_physics
  • high_school_psychology
  • high_school_statistics
  • high_school_us_history
  • high_school_world_history
  • human_aging
  • human_sexuality
  • international_law
  • jurisprudence
  • logical_fallacies
  • machine_learning
  • management
  • marketing
  • medical_genetics
  • miscellaneous
  • moral_disputes
  • moral_scenarios
  • nutrition
  • philosophy
  • prehistory
  • professional_accounting
  • professional_law
  • professional_medicine
  • professional_psychology
  • public_relations
  • security_studies
  • sociology
  • us_foreign_policy
  • virology
  • world_religions

数据分割详情

每个学科配置的具体数据分割详情如下:

  • dev: 开发集,包含4个示例。
  • test: 测试集,包含不同数量的示例,具体数量因学科而异。
  • validation: 验证集,包含不同数量的示例,具体数量因学科而异。

示例数据分割详情

  • abstract_algebra:

    • dev: 4个示例,1213字节
    • test: 99个示例,30380字节
    • validation: 10个示例,2990字节
  • anatomy:

    • dev: 4个示例,1023字节
    • test: 134个示例,44968字节
    • validation: 13个示例,4074字节
  • astronomy:

    • dev: 4个示例,2775字节
    • test: 151个示例,72243字节
    • validation: 15个示例,6884字节
  • business_ethics:

    • dev: 4个示例,2754字节
    • test: 99个示例,47509字节
    • validation: 10个示例,4131字节
  • clinical_knowledge:

    • dev: 4个示例,1615字节
    • test: 264个示例,92165字节
    • validation: 28个示例,9846字节
  • college_biology:

    • dev: 4个示例,1514字节
    • test: 143个示例,70502字节
    • validation: 15个示例,7086字节
  • college_chemistry:

    • dev: 4个示例,1350字节
    • test: 99个示例,35099字节
    • validation: 7个示例,2807字节
  • college_computer_science:

    • dev: 4个示例,3582字节
    • test: 99个示例,64366字节
    • validation: 10个示例,6475字节
  • college_mathematics:

    • dev: 4个示例,1696字节
    • test: 99个示例,35750字节
    • validation: 10个示例,3410字节
  • college_medicine:

    • dev: 4个示例,2090字节
    • test: 172个示例,119254字节
    • validation: 21个示例,10820字节
  • college_physics:

    • dev: 4个示例,1504字节
    • test: 101个示例,41574字节
    • validation: 10个示例,4353字节
  • computer_security:

    • dev: 4个示例,1681字节
    • test: 99个示例,43455字节
    • validation: 10个示例,6697字节
  • conceptual_physics:

    • dev: 4个示例,1210字节
    • test: 234个示例,63735字节
    • validation: 25个示例,6752字节
  • econometrics:

    • dev: 4个示例,1997字节
    • test: 113个示例,65356字节
    • validation: 11个示例,6793字节
  • electrical_engineering:

    • dev: 4个示例,1207字节
    • test: 144个示例,38344字节
    • validation: 15个示例,4115字节
  • elementary_mathematics:

    • dev: 4个示例,1728字节
    • test: 377个示例,100660字节
    • validation: 40个示例,12903字节
  • formal_logic:

    • dev: 4个示例,2365字节
    • test: 125个示例,73028字节
    • validation: 13个示例,8768字节
  • global_facts:

    • dev: 4个示例,1118字节
    • test: 99个示例,29486字节
    • validation: 9个示例,2736字节
  • high_school_biology:

    • dev: 4个示例,2162字节
    • test: 309个示例,156715字节
    • validation: 31个示例,14527字节
  • high_school_chemistry:

    • dev: 4个示例,1656字节
    • test: 202个示例,82374字节
    • validation: 21个示例,9753字节
  • high_school_computer_science:

    • dev: 4个示例,3770字节
    • test: 99个示例,67680字节
    • validation: 8个示例,4744字节
  • high_school_european_history:

    • dev: 4个示例,13380字节
    • test: 164个示例,379904字节
    • validation: 17个示例,38640字节
  • high_school_geography:

    • dev: 4个示例,1903字节
    • test: 197个示例,64542字节
    • validation: 21个示例,6151字节
  • high_school_government_and_politics:

    • dev: 4个示例,1931字节
    • test: 192个示例,98507字节
    • validation: 20个示例,9710字节
  • high_school_macroeconomics:

    • dev: 4个示例,1568字节
    • test: 389个示例,175522字节
    • validation: 42个示例,18938字节
  • high_school_mathematics:

    • dev: 4个示例,1183字节
    • test: 269个示例,76921字节
    • validation: 28个示例,7961字节
  • high_school_microeconomics:

    • dev: 4个示例,1197字节
    • test: 237个示例,110403字节
    • validation: 25个示例,10736字节
  • high_school_physics:

    • dev: 4个示例,1789字节
    • test: 150个示例,84860字节
    • validation: 16个示例,8807字节
  • high_school_psychology:

    • dev: 4个示例,2191字节
    • test: 544个示例,237454字节
    • validation: 59个示例,25261字节
  • high_school_statistics:

    • dev: 4个示例,2829字节
    • test: 215个示例,160308字节
    • validation: 22个示例,14465字节
  • high_school_us_history:

    • dev: 4个示例,11136字节
    • test: 203个示例,427246字节
    • validation: 21个示例,44180字节
  • high_school_world_history:

    • dev: 4个示例,6339字节
    • test: 236个示例,544262字节
    • validation: 25个示例,63826字节
  • human_aging:

    • dev: 4个示例,1301字节
    • test: 222个示例,72894字节
    • validation: 22个示例,7047字节
  • human_sexuality:

    • dev: 4个示例,1286字节
    • test: 130个示例,46845字节
    • validation: 11个示例,3231字节
  • international_law:

    • dev: 4个示例,2841字节
    • test: 120个示例,78414字节
    • validation: 12个示例,8742字节
  • jurisprudence:

    • dev: 4个示例,1336字节
    • test: 107个示例,49177字节
    • validation: 10个示例,5453字节
  • logical_fallacies:

    • dev: 4个示例,1958字节
    • test: 162个示例,76985字节
    • validation: 17个示例,7516字节
  • machine_learning:

    • dev: 4个示例,3179字节
    • test: 111个示例,54414字节
    • validation: 10个示例,4357字节
  • management:

    • dev: 4个示例,870字节
    • test: 102个示例,29869字节
    • validation: 10个示例,2530字节
  • marketing:

    • dev: 4个示例,1977字节
    • test: 233个示例,95368字节
    • validation: 24个示例,10670字节
  • medical_genetics:

    • dev: 4个示例,1297字节
    • test: 99个示例,29741字节
    • validation: 10个示例,3815字节
  • miscellaneous:

    • dev: 4个示例,874字节
    • test: 782个示例,223389字节
    • validation: 85个示例,21001字节
  • moral_disputes:

    • dev: 4个示例,1842字节
    • test: 345个示例,165916字节
    • validation: 37个示例,18415字节
  • moral_scenarios:

    • dev: 4个示例,2582字节
    • test: 894个示例,614251字节
    • validation: 99个示例,68302字节
  • nutrition:

    • dev: 4个示例,2212字节
    • test: 305个示例,135605字节
    • validation: 32个示例,11919字节
  • philosophy:

    • dev: 4个示例,997字节
    • test: 310个示例,121539字节
    • validation: 33个示例,12763字节
  • prehistory:

    • dev: 4个示例,2269字节
    • test: 323个示例,132441字节
    • validation: 34个示例,15041字节
  • professional_accounting:

    • dev: 4个示例,2409字节
    • test: 281个示例,178410字节
    • validation: 30个示例,20331字节
  • professional_law:

    • dev: 4个示例,7449字节
    • test: 1533个示例,2730513字节
    • validation: 169个示例,294872字节
  • professional_medicine:

    • dev: 4个示例,3669字节
    • test: 271个示例,298852字节
    • validation: 30个示例,31340字节
  • professional_psychology:

    • dev: 4个示例,1936字节
    • test: 611个示例,337821字节
    • validation: 68个示例,43121字节
  • public_relations:

    • dev: 4个示例,1592字节
    • test: 109个示例,42078字节
    • validation: 11个示例,6406字节
  • security_studies:

    • dev: 4个示例,5725字节
    • test: 244个示例,30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作