five

erhwenkuo/ceval-exam-zhtw

收藏
Hugging Face2023-10-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/erhwenkuo/ceval-exam-zhtw
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: accountant features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 177004 num_examples: 443 - name: val num_bytes: 19555 num_examples: 49 - name: dev num_bytes: 3414 num_examples: 5 download_size: 151561 dataset_size: 199973 - config_name: advanced_mathematics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 50031 num_examples: 173 - name: val num_bytes: 5331 num_examples: 19 - name: dev num_bytes: 7021 num_examples: 5 download_size: 50945 dataset_size: 62383 - config_name: art_studies features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 41230 num_examples: 298 - name: val num_bytes: 4581 num_examples: 33 - name: dev num_bytes: 1439 num_examples: 5 download_size: 46573 dataset_size: 47250 - config_name: basic_medicine features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 28820 num_examples: 175 - name: val num_bytes: 2627 num_examples: 19 - name: dev num_bytes: 1825 num_examples: 5 download_size: 37502 dataset_size: 33272 - config_name: business_administration features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 78396 num_examples: 301 - name: val num_bytes: 9225 num_examples: 33 - name: dev num_bytes: 3155 num_examples: 5 download_size: 75404 dataset_size: 90776 - config_name: chinese_language_and_literature features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 32328 num_examples: 209 - name: val num_bytes: 3446 num_examples: 23 - name: dev num_bytes: 1892 num_examples: 5 download_size: 43537 dataset_size: 37666 - config_name: civil_servant features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 181519 num_examples: 429 - name: val num_bytes: 21273 num_examples: 47 - name: dev num_bytes: 4576 num_examples: 5 download_size: 180536 dataset_size: 207368 - config_name: clinical_medicine features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 42161 num_examples: 200 - name: val num_bytes: 4167 num_examples: 22 - name: dev num_bytes: 1951 num_examples: 5 download_size: 48783 dataset_size: 48279 - config_name: college_chemistry features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 45801 num_examples: 224 - name: val num_bytes: 4443 num_examples: 24 - name: dev num_bytes: 3611 num_examples: 5 download_size: 53682 dataset_size: 53855 - config_name: college_economics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 119746 num_examples: 497 - name: val num_bytes: 14461 num_examples: 55 - name: dev num_bytes: 3673 num_examples: 5 download_size: 106480 dataset_size: 137880 - config_name: college_physics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 55731 num_examples: 176 - name: val num_bytes: 6145 num_examples: 19 - name: dev num_bytes: 3824 num_examples: 5 download_size: 62806 dataset_size: 65700 - config_name: college_programming features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 84024 num_examples: 342 - name: val num_bytes: 9615 num_examples: 37 - name: dev num_bytes: 2900 num_examples: 5 download_size: 83274 dataset_size: 96539 - config_name: computer_architecture features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 41173 num_examples: 193 - name: val num_bytes: 4188 num_examples: 21 - name: dev num_bytes: 2841 num_examples: 5 download_size: 48203 dataset_size: 48202 - config_name: computer_network features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 35495 num_examples: 171 - name: val num_bytes: 3814 num_examples: 19 - name: dev num_bytes: 2364 num_examples: 5 download_size: 43988 dataset_size: 41673 - config_name: discrete_mathematics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 36057 num_examples: 153 - name: val num_bytes: 3424 num_examples: 16 - name: dev num_bytes: 2002 num_examples: 5 download_size: 43029 dataset_size: 41483 - config_name: education_science features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 55756 num_examples: 270 - name: val num_bytes: 5522 num_examples: 29 - name: dev num_bytes: 3093 num_examples: 5 download_size: 59946 dataset_size: 64371 - config_name: electrical_engineer features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 73769 num_examples: 339 - name: val num_bytes: 8327 num_examples: 37 - name: dev num_bytes: 2180 num_examples: 5 download_size: 74147 dataset_size: 84276 - config_name: environmental_impact_assessment_engineer features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 84701 num_examples: 281 - name: val num_bytes: 9186 num_examples: 31 - name: dev num_bytes: 2495 num_examples: 5 download_size: 73813 dataset_size: 96382 - config_name: fire_engineer features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 83743 num_examples: 282 - name: val num_bytes: 10016 num_examples: 31 - name: dev num_bytes: 2209 num_examples: 5 download_size: 82070 dataset_size: 95968 - config_name: high_school_biology features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 55242 num_examples: 175 - name: val num_bytes: 6105 num_examples: 19 - name: dev num_bytes: 2164 num_examples: 5 download_size: 60835 dataset_size: 63511 - config_name: high_school_chemistry features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 46918 num_examples: 172 - name: val num_bytes: 5625 num_examples: 19 - name: dev num_bytes: 2576 num_examples: 5 download_size: 55719 dataset_size: 55119 - config_name: high_school_chinese features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 110380 num_examples: 178 - name: val num_bytes: 10475 num_examples: 19 - name: dev num_bytes: 5290 num_examples: 5 download_size: 120269 dataset_size: 126145 - config_name: high_school_geography features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 41232 num_examples: 178 - name: val num_bytes: 3985 num_examples: 19 - name: dev num_bytes: 2087 num_examples: 5 download_size: 50092 dataset_size: 47304 - config_name: high_school_history features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 56205 num_examples: 182 - name: val num_bytes: 6624 num_examples: 20 - name: dev num_bytes: 2421 num_examples: 5 download_size: 68561 dataset_size: 65250 - config_name: high_school_mathematics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 41095 num_examples: 166 - name: val num_bytes: 5144 num_examples: 18 - name: dev num_bytes: 3552 num_examples: 5 download_size: 53179 dataset_size: 49791 - config_name: high_school_physics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 61682 num_examples: 175 - name: val num_bytes: 7266 num_examples: 19 - name: dev num_bytes: 2266 num_examples: 5 download_size: 66481 dataset_size: 71214 - config_name: high_school_politics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 83428 num_examples: 176 - name: val num_bytes: 8912 num_examples: 19 - name: dev num_bytes: 4730 num_examples: 5 download_size: 90433 dataset_size: 97070 - config_name: ideological_and_moral_cultivation features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 35315 num_examples: 172 - name: val num_bytes: 3241 num_examples: 19 - name: dev num_bytes: 1296 num_examples: 5 download_size: 41159 dataset_size: 39852 - config_name: law features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 79806 num_examples: 221 - name: val num_bytes: 8119 num_examples: 24 - name: dev num_bytes: 4142 num_examples: 5 download_size: 83236 dataset_size: 92067 - config_name: legal_professional features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 122000 num_examples: 215 - name: val num_bytes: 12215 num_examples: 23 - name: dev num_bytes: 6974 num_examples: 5 download_size: 125256 dataset_size: 141189 - config_name: logic features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 144288 num_examples: 204 - name: val num_bytes: 15558 num_examples: 22 - name: dev num_bytes: 5641 num_examples: 5 download_size: 142564 dataset_size: 165487 - config_name: mao_zedong_thought features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 56708 num_examples: 219 - name: val num_bytes: 5487 num_examples: 24 - name: dev num_bytes: 3352 num_examples: 5 download_size: 57948 dataset_size: 65547 - config_name: marxism features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 38674 num_examples: 179 - name: val num_bytes: 4251 num_examples: 19 - name: dev num_bytes: 2142 num_examples: 5 download_size: 44933 dataset_size: 45067 - config_name: metrology_engineer features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 47544 num_examples: 219 - name: val num_bytes: 6134 num_examples: 24 - name: dev num_bytes: 2485 num_examples: 5 download_size: 54828 dataset_size: 56163 - config_name: middle_school_biology features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 47267 num_examples: 192 - name: val num_bytes: 5263 num_examples: 21 - name: dev num_bytes: 4327 num_examples: 5 download_size: 58472 dataset_size: 56857 - config_name: middle_school_chemistry features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 47575 num_examples: 185 - name: val num_bytes: 5654 num_examples: 20 - name: dev num_bytes: 3866 num_examples: 5 download_size: 59099 dataset_size: 57095 - config_name: middle_school_geography features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 23332 num_examples: 108 - name: val num_bytes: 2641 num_examples: 12 - name: dev num_bytes: 2148 num_examples: 5 download_size: 37389 dataset_size: 28121 - config_name: middle_school_history features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 47076 num_examples: 207 - name: val num_bytes: 5990 num_examples: 22 - name: dev num_bytes: 2014 num_examples: 5 download_size: 56042 dataset_size: 55080 - config_name: middle_school_mathematics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 33142 num_examples: 177 - name: val num_bytes: 4897 num_examples: 19 - name: dev num_bytes: 3187 num_examples: 5 download_size: 44657 dataset_size: 41226 - config_name: middle_school_physics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 48796 num_examples: 178 - name: val num_bytes: 5279 num_examples: 19 - name: dev num_bytes: 3531 num_examples: 5 download_size: 59820 dataset_size: 57606 - config_name: middle_school_politics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 72499 num_examples: 193 - name: val num_bytes: 7326 num_examples: 21 - name: dev num_bytes: 3687 num_examples: 5 download_size: 76847 dataset_size: 83512 - config_name: modern_chinese_history features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 51247 num_examples: 212 - name: val num_bytes: 5188 num_examples: 23 - name: dev num_bytes: 2983 num_examples: 5 download_size: 59728 dataset_size: 59418 - config_name: operating_system features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 31467 num_examples: 179 - name: val num_bytes: 3335 num_examples: 19 - name: dev num_bytes: 2611 num_examples: 5 download_size: 40349 dataset_size: 37413 - config_name: physician features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 89819 num_examples: 443 - name: val num_bytes: 8713 num_examples: 49 - name: dev num_bytes: 2033 num_examples: 5 download_size: 91464 dataset_size: 100565 - config_name: plant_protection features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 31877 num_examples: 199 - name: val num_bytes: 3634 num_examples: 22 - name: dev num_bytes: 3726 num_examples: 5 download_size: 42813 dataset_size: 39237 - config_name: probability_and_statistics features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 56749 num_examples: 166 - name: val num_bytes: 5781 num_examples: 18 - name: dev num_bytes: 6769 num_examples: 5 download_size: 63258 dataset_size: 69299 - config_name: professional_tour_guide features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 41231 num_examples: 266 - name: val num_bytes: 4509 num_examples: 29 - name: dev num_bytes: 1764 num_examples: 5 download_size: 51642 dataset_size: 47504 - config_name: sports_science features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 32536 num_examples: 180 - name: val num_bytes: 3493 num_examples: 19 - name: dev num_bytes: 4182 num_examples: 5 download_size: 45905 dataset_size: 40211 - config_name: tax_accountant features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 174509 num_examples: 443 - name: val num_bytes: 18938 num_examples: 49 - name: dev num_bytes: 4274 num_examples: 5 download_size: 148037 dataset_size: 197721 - config_name: teacher_qualification features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 107372 num_examples: 399 - name: val num_bytes: 12220 num_examples: 44 - name: dev num_bytes: 3212 num_examples: 5 download_size: 105439 dataset_size: 122804 - config_name: urban_and_rural_planner features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 110473 num_examples: 418 - name: val num_bytes: 12793 num_examples: 46 - name: dev num_bytes: 3184 num_examples: 5 download_size: 101932 dataset_size: 126450 - config_name: veterinary_medicine features: - name: id dtype: int32 - name: question dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: answer dtype: string - name: explanation dtype: string splits: - name: test num_bytes: 39465 num_examples: 210 - name: val num_bytes: 4562 num_examples: 23 - name: dev num_bytes: 2365 num_examples: 5 download_size: 48753 dataset_size: 46392 configs: - config_name: accountant data_files: - split: test path: accountant/test-* - split: val path: accountant/val-* - split: dev path: accountant/dev-* - config_name: advanced_mathematics data_files: - split: test path: advanced_mathematics/test-* - split: val path: advanced_mathematics/val-* - split: dev path: advanced_mathematics/dev-* - config_name: art_studies data_files: - split: test path: art_studies/test-* - split: val path: art_studies/val-* - split: dev path: art_studies/dev-* - config_name: basic_medicine data_files: - split: test path: basic_medicine/test-* - split: val path: basic_medicine/val-* - split: dev path: basic_medicine/dev-* - config_name: business_administration data_files: - split: test path: business_administration/test-* - split: val path: business_administration/val-* - split: dev path: business_administration/dev-* - config_name: chinese_language_and_literature data_files: - split: test path: chinese_language_and_literature/test-* - split: val path: chinese_language_and_literature/val-* - split: dev path: chinese_language_and_literature/dev-* - config_name: civil_servant data_files: - split: test path: civil_servant/test-* - split: val path: civil_servant/val-* - split: dev path: civil_servant/dev-* - config_name: clinical_medicine data_files: - split: test path: clinical_medicine/test-* - split: val path: clinical_medicine/val-* - split: dev path: clinical_medicine/dev-* - config_name: college_chemistry data_files: - split: test path: college_chemistry/test-* - split: val path: college_chemistry/val-* - split: dev path: college_chemistry/dev-* - config_name: college_economics data_files: - split: test path: college_economics/test-* - split: val path: college_economics/val-* - split: dev path: college_economics/dev-* - config_name: college_physics data_files: - split: test path: college_physics/test-* - split: val path: college_physics/val-* - split: dev path: college_physics/dev-* - config_name: college_programming data_files: - split: test path: college_programming/test-* - split: val path: college_programming/val-* - split: dev path: college_programming/dev-* - config_name: computer_architecture data_files: - split: test path: computer_architecture/test-* - split: val path: computer_architecture/val-* - split: dev path: computer_architecture/dev-* - config_name: computer_network data_files: - split: test path: computer_network/test-* - split: val path: computer_network/val-* - split: dev path: computer_network/dev-* - config_name: discrete_mathematics data_files: - split: test path: discrete_mathematics/test-* - split: val path: discrete_mathematics/val-* - split: dev path: discrete_mathematics/dev-* - config_name: education_science data_files: - split: test path: education_science/test-* - split: val path: education_science/val-* - split: dev path: education_science/dev-* - config_name: electrical_engineer data_files: - split: test path: electrical_engineer/test-* - split: val path: electrical_engineer/val-* - split: dev path: electrical_engineer/dev-* - config_name: environmental_impact_assessment_engineer data_files: - split: test path: environmental_impact_assessment_engineer/test-* - split: val path: environmental_impact_assessment_engineer/val-* - split: dev path: environmental_impact_assessment_engineer/dev-* - config_name: fire_engineer data_files: - split: test path: fire_engineer/test-* - split: val path: fire_engineer/val-* - split: dev path: fire_engineer/dev-* - config_name: high_school_biology data_files: - split: test path: high_school_biology/test-* - split: val path: high_school_biology/val-* - split: dev path: high_school_biology/dev-* - config_name: high_school_chemistry data_files: - split: test path: high_school_chemistry/test-* - split: val path: high_school_chemistry/val-* - split: dev path: high_school_chemistry/dev-* - config_name: high_school_chinese data_files: - split: test path: high_school_chinese/test-* - split: val path: high_school_chinese/val-* - split: dev path: high_school_chinese/dev-* - config_name: high_school_geography data_files: - split: test path: high_school_geography/test-* - split: val path: high_school_geography/val-* - split: dev path: high_school_geography/dev-* - config_name: high_school_history data_files: - split: test path: high_school_history/test-* - split: val path: high_school_history/val-* - split: dev path: high_school_history/dev-* - config_name: high_school_mathematics data_files: - split: test path: high_school_mathematics/test-* - split: val path: high_school_mathematics/val-* - split: dev path: high_school_mathematics/dev-* - config_name: high_school_physics data_files: - split: test path: high_school_physics/test-* - split: val path: high_school_physics/val-* - split: dev path: high_school_physics/dev-* - config_name: high_school_politics data_files: - split: test path: high_school_politics/test-* - split: val path: high_school_politics/val-* - split: dev path: high_school_politics/dev-* - config_name: ideological_and_moral_cultivation data_files: - split: test path: ideological_and_moral_cultivation/test-* - split: val path: ideological_and_moral_cultivation/val-* - split: dev path: ideological_and_moral_cultivation/dev-* - config_name: law data_files: - split: test path: law/test-* - split: val path: law/val-* - split: dev path: law/dev-* - config_name: legal_professional data_files: - split: test path: legal_professional/test-* - split: val path: legal_professional/val-* - split: dev path: legal_professional/dev-* - config_name: logic data_files: - split: test path: logic/test-* - split: val path: logic/val-* - split: dev path: logic/dev-* - config_name: mao_zedong_thought data_files: - split: test path: mao_zedong_thought/test-* - split: val path: mao_zedong_thought/val-* - split: dev path: mao_zedong_thought/dev-* - config_name: marxism data_files: - split: test path: marxism/test-* - split: val path: marxism/val-* - split: dev path: marxism/dev-* - config_name: metrology_engineer data_files: - split: test path: metrology_engineer/test-* - split: val path: metrology_engineer/val-* - split: dev path: metrology_engineer/dev-* - config_name: middle_school_biology data_files: - split: test path: middle_school_biology/test-* - split: val path: middle_school_biology/val-* - split: dev path: middle_school_biology/dev-* - config_name: middle_school_chemistry data_files: - split: test path: middle_school_chemistry/test-* - split: val path: middle_school_chemistry/val-* - split: dev path: middle_school_chemistry/dev-* - config_name: middle_school_geography data_files: - split: test path: middle_school_geography/test-* - split: val path: middle_school_geography/val-* - split: dev path: middle_school_geography/dev-* - config_name: middle_school_history data_files: - split: test path: middle_school_history/test-* - split: val path: middle_school_history/val-* - split: dev path: middle_school_history/dev-* - config_name: middle_school_mathematics data_files: - split: test path: middle_school_mathematics/test-* - split: val path: middle_school_mathematics/val-* - split: dev path: middle_school_mathematics/dev-* - config_name: middle_school_physics data_files: - split: test path: middle_school_physics/test-* - split: val path: middle_school_physics/val-* - split: dev path: middle_school_physics/dev-* - config_name: middle_school_politics data_files: - split: test path: middle_school_politics/test-* - split: val path: middle_school_politics/val-* - split: dev path: middle_school_politics/dev-* - config_name: modern_chinese_history data_files: - split: test path: modern_chinese_history/test-* - split: val path: modern_chinese_history/val-* - split: dev path: modern_chinese_history/dev-* - config_name: operating_system data_files: - split: test path: operating_system/test-* - split: val path: operating_system/val-* - split: dev path: operating_system/dev-* - config_name: physician data_files: - split: test path: physician/test-* - split: val path: physician/val-* - split: dev path: physician/dev-* - config_name: plant_protection data_files: - split: test path: plant_protection/test-* - split: val path: plant_protection/val-* - split: dev path: plant_protection/dev-* - config_name: probability_and_statistics data_files: - split: test path: probability_and_statistics/test-* - split: val path: probability_and_statistics/val-* - split: dev path: probability_and_statistics/dev-* - config_name: professional_tour_guide data_files: - split: test path: professional_tour_guide/test-* - split: val path: professional_tour_guide/val-* - split: dev path: professional_tour_guide/dev-* - config_name: sports_science data_files: - split: test path: sports_science/test-* - split: val path: sports_science/val-* - split: dev path: sports_science/dev-* - config_name: tax_accountant data_files: - split: test path: tax_accountant/test-* - split: val path: tax_accountant/val-* - split: dev path: tax_accountant/dev-* - config_name: teacher_qualification data_files: - split: test path: teacher_qualification/test-* - split: val path: teacher_qualification/val-* - split: dev path: teacher_qualification/dev-* - config_name: urban_and_rural_planner data_files: - split: test path: urban_and_rural_planner/test-* - split: val path: urban_and_rural_planner/val-* - split: dev path: urban_and_rural_planner/dev-* - config_name: veterinary_medicine data_files: - split: test path: veterinary_medicine/test-* - split: val path: veterinary_medicine/val-* - split: dev path: veterinary_medicine/dev-* license: cc language: - zh tags: - '"llm-eval"' --- # Dataset Card for "ceval-exam-zhtw" C-Eval 是一個針對基礎模型的綜合中文評估套件。它由 13,948 道多項選擇題組成,涵蓋 52 個不同的學科和四個難度級別。[原始網站](https://cevalbenchmark.com/)和 [GitHub](https://github.com/SJTU-LIT/ceval/tree/main) 或查看[論文](https://arxiv.org/abs/2305.08322)以了解更多詳細資訊。 C-Eval 主要的數據都是使用簡體中文來撰寫并且用來評測簡體中文的 LLM 的效能來設計的,本數據集使用 OpenCC 來進行簡繁的中文轉換,主要目的方便繁中 LLM 的開發與驗測。 ## 下載 使用 Hugging Face `datasets` 直接載入資料集: ```python from datasets import load_dataset dataset=load_dataset(r"erhwenkuo/ceval-exam-zhtw",name="computer_network") print(dataset['val'][0]) # {'id': 0, 'question': '使用位填充方法,以01111110為位首flag,資料為011011111111111111110010,求問傳送時要新增幾個0____', 'A': '1', 'B': '2', 'C': '3', 'D': '4', 'answer': 'C', 'explanation': ''} ``` ## 授權 C-Eval 資料集根據 Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License 授權。 ## Citation 如果您使用這個資料集,請引用原始 C-Eval 的論文。 ``` @article{huang2023ceval, title={C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models}, author={Huang, Yuzhen and Bai, Yuzhuo and Zhu, Zhihao and Zhang, Junlei and Zhang, Jinghan and Su, Tangjun and Liu, Junteng and Lv, Chuancheng and Zhang, Yikai and Lei, Jiayi and Fu, Yao and Sun, Maosong and He, Junxian}, journal={arXiv preprint arXiv:2305.08322}, year={2023} } ```
提供机构:
erhwenkuo
原始信息汇总

数据集概述

该数据集包含多个配置,每个配置代表不同学科的题目。每个配置包含以下字段:

  • id: 题目ID,数据类型为int32。
  • question: 题目内容,数据类型为string。
  • A: 选项A,数据类型为string。
  • B: 选项B,数据类型为string。
  • C: 选项C,数据类型为string。
  • D: 选项D,数据类型为string。
  • answer: 正确答案,数据类型为string。
  • explanation: 答案解释,数据类型为string。

每个配置还包含以下数据分割:

  • test: 测试集,包含字节数和样本数。
  • val: 验证集,包含字节数和样本数。
  • dev: 开发集,包含字节数和样本数。

每个配置还提供了下载大小和数据集大小。

具体配置信息

  1. accountant

    • 下载大小: 151561字节
    • 数据集大小: 199973字节
    • 分割:
      • test: 177004字节, 443样本
      • val: 19555字节, 49样本
      • dev: 3414字节, 5样本
  2. advanced_mathematics

    • 下载大小: 50945字节
    • 数据集大小: 62383字节
    • 分割:
      • test: 50031字节, 173样本
      • val: 5331字节, 19样本
      • dev: 7021字节, 5样本
  3. art_studies

    • 下载大小: 46573字节
    • 数据集大小: 47250字节
    • 分割:
      • test: 41230字节, 298样本
      • val: 4581字节, 33样本
      • dev: 1439字节, 5样本
  4. basic_medicine

    • 下载大小: 37502字节
    • 数据集大小: 33272字节
    • 分割:
      • test: 28820字节, 175样本
      • val: 2627字节, 19样本
      • dev: 1825字节, 5样本
  5. business_administration

    • 下载大小: 75404字节
    • 数据集大小: 90776字节
    • 分割:
      • test: 78396字节, 301样本
      • val: 9225字节, 33样本
      • dev: 3155字节, 5样本
  6. chinese_language_and_literature

    • 下载大小: 43537字节
    • 数据集大小: 37666字节
    • 分割:
      • test: 32328字节, 209样本
      • val: 3446字节, 23样本
      • dev: 1892字节, 5样本
  7. civil_servant

    • 下载大小: 180536字节
    • 数据集大小: 207368字节
    • 分割:
      • test: 181519字节, 429样本
      • val: 21273字节, 47样本
      • dev: 4576字节, 5样本
  8. clinical_medicine

    • 下载大小: 48783字节
    • 数据集大小: 48279字节
    • 分割:
      • test: 42161字节, 200样本
      • val: 4167字节, 22样本
      • dev: 1951字节, 5样本
  9. college_chemistry

    • 下载大小: 53682字节
    • 数据集大小: 53855字节
    • 分割:
      • test: 45801字节, 224样本
      • val: 4443字节, 24样本
      • dev: 3611字节, 5样本
  10. college_economics

    • 下载大小: 106480字节
    • 数据集大小: 137880字节
    • 分割:
      • test: 119746字节, 497样本
      • val: 14461字节, 55样本
      • dev: 3673字节, 5样本
  11. college_physics

    • 下载大小: 62806字节
    • 数据集大小: 65700字节
    • 分割:
      • test: 55731字节, 176样本
      • val: 6145字节, 19样本
      • dev: 3824字节, 5样本
  12. college_programming

    • 下载大小: 83274字节
    • 数据集大小: 96539字节
    • 分割:
      • test: 84024字节, 342样本
      • val: 9615字节, 37样本
      • dev: 2900字节, 5样本
  13. computer_architecture

    • 下载大小: 48203字节
    • 数据集大小: 48202字节
    • 分割:
      • test: 41173字节, 193样本
      • val: 4188字节, 21样本
      • dev: 2841字节, 5样本
  14. computer_network

    • 下载大小: 43988字节
    • 数据集大小: 41673字节
    • 分割:
      • test: 35495字节, 171样本
      • val: 3814字节, 19样本
      • dev: 2364字节, 5样本
  15. discrete_mathematics

    • 下载大小: 43029字节
    • 数据集大小: 41483字节
    • 分割:
      • test: 36057字节, 153样本
      • val: 3424字节, 16样本
      • dev: 2002字节, 5样本
  16. education_science

    • 下载大小: 59946字节
    • 数据集大小: 64371字节
    • 分割:
      • test: 55756字节, 270样本
      • val: 5522字节, 29样本
      • dev: 3093字节, 5样本
  17. electrical_engineer

    • 下载大小: 74147字节
    • 数据集大小: 84276字节
    • 分割:
      • test: 73769字节, 339样本
      • val: 8327字节, 37样本
      • dev: 2180字节, 5样本
  18. environmental_impact_assessment_engineer

    • 下载大小: 73813字节
    • 数据集大小: 96382字节
    • 分割:
      • test: 84701字节, 281样本
      • val: 9186字节, 31样本
      • dev: 2495字节, 5样本
  19. fire_engineer

    • 下载大小: 82070字节
    • 数据集大小: 95968字节
    • 分割:
      • test: 83743字节, 282样本
      • val: 10016字节, 31样本
      • dev: 2209字节, 5样本
  20. high_school_biology

    • 下载大小: 60835字节
    • 数据集大小: 63511字节
    • 分割:
      • test: 55242字节, 175样本
      • val: 6105字节, 19样本
      • dev: 2164字节, 5样本
  21. high_school_chemistry

    • 下载大小: 55719字节
    • 数据集大小: 55119字节
    • 分割:
      • test: 46918字节, 172样本
      • val: 5625字节, 19样本
      • dev: 2576字节, 5样本
  22. high_school_chinese

    • 下载大小: 120269字节
    • 数据集大小: 126145字节
    • 分割:
      • test: 110380字节, 178样本
      • val: 10475字节, 19样本
      • dev: 5290字节, 5样本
  23. high_school_geography

    • 下载大小: 50092字节
    • 数据集大小: 47304字节
    • 分割:
      • test: 41232字节, 178样本
      • val: 3985字节, 19样本
      • dev: 2087字节, 5样本
  24. high_school_history

    • 下载大小: 68561字节
    • 数据集大小: 65250字节
    • 分割:
      • test: 56205字节, 182样本
      • val: 6624字节, 20样本
      • dev: 2421字节, 5样本
  25. high_school_mathematics

    • 下载大小: 53179字节
    • 数据集大小: 49791字节
    • 分割:
      • test: 41095字节, 166样本
      • val: 5144字节, 18样本
      • dev: 3552字节, 5样本
  26. high_school_physics

    • 下载大小: 66481字节
    • 数据集大小: 71214字节
    • 分割:
      • test: 61682字节, 175样本
      • val: 7266字节, 19样本
      • dev: 2266字节, 5样本
  27. high_school_politics

    • 下载大小: 90433字节
    • 数据集大小: 97070字节
    • 分割:
      • test: 83428字节, 176样本
      • val: 8912字节, 19样本
      • dev: 4730字节, 5样本
  28. ideological_and_moral_cultivation

    • 下载大小: 41159字节
    • 数据集大小: 39852字节
    • 分割:
      • test: 35315字节, 172样本
      • val: 3241字节, 19样本
      • dev: 1296字节, 5样本
  29. law

    • 下载大小: 83236字节
    • 数据集大小: 92067字节
    • 分割:
      • test: 79806字节, 221样本
      • val: 8119字节, 24样本
      • dev: 4142字节, 5样本
  30. legal_professional

    • 下载大小: 125256字节
    • 数据集大小: 141189字节
    • 分割:
      • test: 122000字节, 215样本
      • val: 12215字节, 23样本
      • dev: 6974字节, 5样本
  31. logic

    • 下载大小: 142564字节
    • 数据集大小: 165487字节
    • 分割:
      • test: 144288字节, 204样本
      • val: 15558字节, 22样本
      • dev: 5641字节, 5样本
  32. mao_zedong_thought

    • 下载大小: 57948字节
    • 数据集大小: 65547字节
    • 分割:
      • test: 56708字节, 219样本
      • val: 5487字节, 24样本
      • dev: 3352字节, 5样本
  33. marxism

    • 下载大小: 44933字节
    • 数据集大小: 45067字节
    • 分割:
      • test: 38674字节, 179样本
      • val: 4251字节, 19样本
      • dev: 2142字节, 5样本
  34. metrology_engineer

    • 下载大小: 54828字节
    • 数据集大小: 56163字节
    • 分割:
      • test: 47544字节, 219样本
      • val: 6134字节, 24样本
      • dev: 2485字节, 5样本
  35. middle_school_biology

    • 下载大小: 58472字节
    • 数据集大小: 56857字节
    • 分割:
      • test: 47267字节, 192样本
      • val: 5263字节, 21样本
      • dev: 4327字节, 5样本
  36. middle_school_chemistry

    • 下载大小: 59099字节
    • 数据集大小: 57095字节
    • 分割:
      • test: 47575字节, 185样本
      • val: 5654字节, 20样本
      • dev: 3866字节, 5样本
  37. middle_school_geography

    • 下载大小: 37389字节
    • 数据集大小: 28121字节
    • 分割:
      • test: 23332字节, 108样本
      • val: 2641字节, 12样本
      • dev: 2148字节, 5样本
  38. middle_school_history

    • 下载大小: 37389字节
    • 数据集大小: 28121字节
    • 分割:
      • test: 23332字节, 108样本
      • val: 2641字节, 12样本
      • dev: 2148字节, 5样本
搜集汇总
数据集介绍
main_image_url
构建方式
在中文教育评估领域,该数据集通过系统化采集与整理,构建了一个涵盖多学科的中文考试题库。其构建过程涉及从会计学、高等数学、艺术研究等52个专业或学科领域中,精心筛选具有代表性的单项选择题。每道题目均包含标准化的结构,即问题描述、四个备选选项、正确答案及详细解析,确保了数据的规范性与完整性。数据集的划分遵循机器学习常规,为每个学科配置了测试集、验证集和开发集,便于模型训练与评估的顺利进行。
特点
该数据集展现了跨学科知识覆盖的广度与深度,囊括了从基础教育到高等专业领域的多元主题。其显著特点在于每个条目均配备了详尽的解析文本,这不仅提供了答案的依据,还增强了数据集的教育价值。数据以结构化格式存储,每个学科独立配置,便于针对性研究与分析。题目设计兼顾理论知识与实践应用,反映了真实考试场景中的复杂性与多样性,为模型能力评估提供了丰富而严谨的基准。
使用方法
在自然语言处理与教育技术研究中,该数据集主要用于评估模型在中文语境下的知识理解与推理能力。研究人员可加载特定学科的配置,利用测试集进行模型性能的量化分析。验证集可用于超参数调优,而开发集则支持初步实验与快速迭代。通过解析字段,模型不仅能学习答案预测,还可深入探究解题逻辑,促进可解释人工智能的发展。该数据集适用于多项选择问答、知识图谱构建及自适应学习系统的开发。
背景与挑战
背景概述
在人工智能与自然语言处理领域,大规模知识评估数据集对于衡量模型的多学科理解能力至关重要。erhwenkuo/ceval-exam-zhtw数据集应运而生,其构建源于对中文语境下专业学科知识进行系统性评测的需求。该数据集由研究人员erhwenkuo创建,旨在通过涵盖会计、高等数学、艺术学、基础医学等52个专业领域的多项选择题,评估模型在复杂知识推理与跨学科应用中的表现。其核心研究问题聚焦于探索模型在中文专业术语、逻辑推理及学科交叉背景下的准确性与泛化能力,为中文大语言模型的性能基准测试提供了重要工具,推动了教育技术与智能评估系统的发展。
当前挑战
该数据集面临的挑战主要体现在两个方面:在领域问题层面,多项选择题设计需平衡学科深度与广度,确保题目既能反映专业知识的核心难点,又避免歧义性,这对模型的知识覆盖与推理能力提出了严峻考验;同时,解释性文本的构建要求精准对应答案,以支撑模型的可解释性学习。在构建过程中,挑战包括从海量中文教育资源中筛选与标准化题目,保证数据来源的权威性与时效性,以及跨学科术语的统一标注,这些工作需克服数据异构性与专业壁垒,确保数据集的严谨性与实用性。
常用场景
经典使用场景
在自然语言处理领域,评估模型的知识理解与推理能力是核心任务之一。erhwenkuo/ceval-exam-zhtw数据集作为一套涵盖多学科的中文选择题库,其经典使用场景在于为大型语言模型提供标准化的知识评估基准。研究者通过该数据集能够系统性地测试模型在会计、高等数学、临床医学等52个专业领域的知识掌握程度,从而衡量模型在复杂语境下的逻辑推理与跨学科知识融合能力。
实际应用
在实际应用层面,该数据集为智能教育系统和专业资格认证工具的开发提供了关键数据支撑。教育科技公司可基于此构建自适应学习平台,通过分析模型在各学科试题上的表现,精准诊断知识薄弱环节并生成个性化学习路径。同时,在会计师、消防工程师等职业资格模拟考试系统中,该数据集的高质量试题与详细解析能够帮助备考者进行针对性训练,提升专业认证考试的通过效率与公平性。
衍生相关工作
围绕该数据集衍生的经典工作主要集中在知识增强型语言模型的构建与评估框架创新。例如,研究者通过融合ceval-exam-zhtw的多学科知识,开发了面向专业领域的预训练微调策略,显著提升了模型在医疗、法律等垂直场景的准确性。同时,基于该数据集构建的层次化评估体系催生了如C-EVAL等综合性评测基准,推动了中文大模型能力评估标准的规范化发展,为后续的模型优化与产业落地提供了重要参照。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作