ZoneTwelve/tmmluplus
收藏TMMLU+ 数据集概述
数据集基本信息
- 许可证: Creative Commons BY-NC
- 任务类别: 问答
- 语言: 中文
- 标签: 繁体中文、金融、医学、台湾、基准测试、zh-tw、zh-hant
- 名称: tmmlu++
- 大小类别: 100K<n<1M
数据集配置
TMMLU+ 数据集包含多个子任务,每个子任务都有训练、验证和测试集。以下是部分子任务及其对应的数据文件路径:
-
engineering_math
- 训练集:
data/engineering_math_dev.csv - 验证集:
data/engineering_math_val.csv - 测试集:
data/engineering_math_test.csv
- 训练集:
-
dentistry
- 训练集:
data/dentistry_dev.csv - 验证集:
data/dentistry_val.csv - 测试集:
data/dentistry_test.csv
- 训练集:
-
traditional_chinese_medicine_clinical_medicine
- 训练集:
data/traditional_chinese_medicine_clinical_medicine_dev.csv - 验证集:
data/traditional_chinese_medicine_clinical_medicine_val.csv - 测试集:
data/traditional_chinese_medicine_clinical_medicine_test.csv
- 训练集:
-
clinical_psychology
- 训练集:
data/clinical_psychology_dev.csv - 验证集:
data/clinical_psychology_val.csv - 测试集:
data/clinical_psychology_test.csv
- 训练集:
-
technical
- 训练集:
data/technical_dev.csv - 验证集:
data/technical_val.csv - 测试集:
data/technical_test.csv
- 训练集:
-
culinary_skills
- 训练集:
data/culinary_skills_dev.csv - 验证集:
data/culinary_skills_val.csv - 测试集:
data/culinary_skills_test.csv
- 训练集:
-
mechanical
- 训练集:
data/mechanical_dev.csv - 验证集:
data/mechanical_val.csv - 测试集:
data/mechanical_test.csv
- 训练集:
-
logic_reasoning
- 训练集:
data/logic_reasoning_dev.csv - 验证集:
data/logic_reasoning_val.csv - 测试集:
data/logic_reasoning_test.csv
- 训练集:
-
real_estate
- 训练集:
data/real_estate_dev.csv - 验证集:
data/real_estate_val.csv - 测试集:
data/real_estate_test.csv
- 训练集:
-
general_principles_of_law
- 训练集:
data/general_principles_of_law_dev.csv - 验证集:
data/general_principles_of_law_val.csv - 测试集:
data/general_principles_of_law_test.csv
- 训练集:
-
finance_banking
- 训练集:
data/finance_banking_dev.csv - 验证集:
data/finance_banking_val.csv - 测试集:
data/finance_banking_test.csv
- 训练集:
-
anti_money_laundering
- 训练集:
data/anti_money_laundering_dev.csv - 验证集:
data/anti_money_laundering_val.csv - 测试集:
data/anti_money_laundering_test.csv
- 训练集:
-
ttqav2
- 训练集:
data/ttqav2_dev.csv - 验证集:
data/ttqav2_val.csv - 测试集:
data/ttqav2_test.csv
- 训练集:
-
marketing_management
- 训练集:
data/marketing_management_dev.csv - 验证集:
data/marketing_management_val.csv - 测试集:
data/marketing_management_test.csv
- 训练集:
-
business_management
- 训练集:
data/business_management_dev.csv - 验证集:
data/business_management_val.csv - 测试集:
data/business_management_test.csv
- 训练集:
-
organic_chemistry
- 训练集:
data/organic_chemistry_dev.csv - 验证集:
data/organic_chemistry_val.csv - 测试集:
data/organic_chemistry_test.csv
- 训练集:
-
advance_chemistry
- 训练集:
data/advance_chemistry_dev.csv - 验证集:
data/advance_chemistry_val.csv - 测试集:
data/advance_chemistry_test.csv
- 训练集:
-
physics
- 训练集:
data/physics_dev.csv - 验证集:
data/physics_val.csv - 测试集:
data/physics_test.csv
- 训练集:
-
secondary_physics
- 训练集:
data/secondary_physics_dev.csv - 验证集:
data/secondary_physics_val.csv - 测试集:
data/secondary_physics_test.csv
- 训练集:
-
human_behavior
- 训练集:
data/human_behavior_dev.csv - 验证集:
data/human_behavior_val.csv - 测试集:
data/human_behavior_test.csv
- 训练集:
-
national_protection
- 训练集:
data/national_protection_dev.csv - 验证集:
data/national_protection_val.csv - 测试集:
data/national_protection_test.csv
- 训练集:
-
jce_humanities
- 训练集:
data/jce_humanities_dev.csv - 验证集:
data/jce_humanities_val.csv - 测试集:
data/jce_humanities_test.csv
- 训练集:
-
politic_science
- 训练集:
data/politic_science_dev.csv - 验证集:
data/politic_science_val.csv - 测试集:
data/politic_science_test.csv
- 训练集:
-
agriculture
- 训练集:
data/agriculture_dev.csv - 验证集:
data/agriculture_val.csv - 测试集:
data/agriculture_test.csv
- 训练集:
-
official_document_management
- 训练集:
data/official_document_management_dev.csv - 验证集:
data/official_document_management_val.csv - 测试集:
data/official_document_management_test.csv
- 训练集:
-
financial_analysis
- 训练集:
data/financial_analysis_dev.csv - 验证集:
data/financial_analysis_val.csv - 测试集:
data/financial_analysis_test.csv
- 训练集:
-
pharmacy
- 训练集:
data/pharmacy_dev.csv - 验证集:
data/pharmacy_val.csv - 测试集:
data/pharmacy_test.csv
- 训练集:
-
educational_psychology
- 训练集:
data/educational_psychology_dev.csv - 验证集:
data/educational_psychology_val.csv - 测试集:
data/educational_psychology_test.csv
- 训练集:
-
statistics_and_machine_learning
- 训练集:
data/statistics_and_machine_learning_dev.csv - 验证集:
data/statistics_and_machine_learning_val.csv - 测试集:
data/statistics_and_machine_learning_test.csv
- 训练集:
-
management_accounting
- 训练集:
data/management_accounting_dev.csv - 验证集:
data/management_accounting_val.csv - 测试集:
data/management_accounting_test.csv
- 训练集:
-
introduction_to_law
- 训练集:
data/introduction_to_law_dev.csv - 验证集:
data/introduction_to_law_val.csv - 测试集:
data/introduction_to_law_test.csv
- 训练集:
-
computer_science
- 训练集:
data/computer_science_dev.csv - 验证集:
data/computer_science_val.csv - 测试集:
data/computer_science_test.csv
- 训练集:
-
veterinary_pathology
- 训练集:
data/veterinary_pathology_dev.csv - 验证集:
data/veterinary_pathology_val.csv - 测试集:
data/veterinary_pathology_test.csv
- 训练集:
-
accounting
- 训练集:
data/accounting_dev.csv - 验证集:
data/accounting_val.csv - 测试集:
data/accounting_test.csv
- 训练集:
-
fire_science
- 训练集:
data/fire_science_dev.csv - 验证集:
data/fire_science_val.csv - 测试集:
data/fire_science_test.csv
- 训练集:
-
optometry
- 训练集:
data/optometry_dev.csv - 验证集:
data/optometry_val.csv - 测试集:
data/optometry_test.csv
- 训练集:
-
insurance_studies
- 训练集:
data/insurance_studies_dev.csv - 验证集:
data/insurance_studies_val.csv - 测试集:
data/insurance_studies_test.csv
- 训练集:
-
pharmacology
- 训练集:
data/pharmacology_dev.csv - 验证集:
data/pharmacology_val.csv - 测试集:
data/pharmacology_test.csv
- 训练集:
-
taxation
- 训练集:
data/taxation_dev.csv - 验证集:
data/taxation_val.csv - 测试集:
data/taxation_test.csv
- 训练集:
-
trust_practice
- 训练集:
data/trust_practice_dev.csv - 验证集:
data/trust_practice_val.csv - 测试集:
data/trust_practice_test.csv
- 训练集:
-
geography_of_taiwan
- 训练集:
data/geography_of_taiwan_dev.csv - 验证集:
data/geography_of_taiwan_val.csv - 测试集:
data/geography_of_taiwan_test.csv
- 训练集:
-
physical_education
- 训练集:
data/physical_education_dev.csv - 验证集:
data/physical_education_val.csv - 测试集:
data/physical_education_test.csv
- 训练集:
-
auditing
- 训练集:
data/auditing_dev.csv - 验证集:
data/auditing_val.csv - 测试集:
data/auditing_test.csv
- 训练集:
-
administrative_law
- 训练集:
data/administrative_law_dev.csv - 验证集:
data/administrative_law_val.csv - 测试集:
data/administrative_law_test.csv
- 训练集:
-
education_(profession_level)
- 训练集:
data/education_(profession_level)_dev.csv - 验证集:
data/education_(profession_level)_val.csv - 测试集:
data/education_(profession_level)_test.csv
- 训练集:
-
economics
- 训练集:
data/economics_dev.csv - 验证集:
data/economics_val.csv - 测试集:
data/economics_test.csv
- 训练集:
-
veterinary_pharmacology
- 训练集:
data/veterinary_pharmacology_dev.csv - 验证集:
data/veterinary_pharmacology_val.csv - 测试集:
data/veterinary_pharmacology_test.csv
- 训练集:
-
nautical_science
- 训练集:
data/nautical_science_dev.csv - 验证集:
data/nautical_science_val.csv - 测试集:
data/nautical_science_test.csv
- 训练集:
-
occupational_therapy_for_psychological_disorders
- 训练集:
data/occupational_therapy_for_psychological_disorders_dev.csv - 验证集:
data/occupational_therapy_for_psychological_disorders_val.csv - 测试集:
data/occupational_therapy_for_psychological_disorders_test.csv
- 训练集:
-
basic_medical_science
- 训练集:
data/basic_medical_science_dev.csv - 验证集:
data/basic_medical_science_val.csv - 测试集:
data/basic_medical_science_test.csv
- 训练集:
-
macroeconomics
- 训练集:
data/macroeconomics_dev.csv - 验证集:
data/macroeconomics_val.csv - 测试集:
data/macroeconomics_test.csv
- 训练集:
-
trade
- 训练集:
data/trade_dev.csv - 验证集:
data/trade_val.csv - 测试集:
data/trade_test.csv
- 训练集:
-
chinese_language_and_literature
- 训练集:
data/chinese_language_and_literature_dev.csv - 验证集:
data/chinese_language_and_literature_val.csv - 测试集:
data/chinese_language_and_literature_test.csv
- 训练集:
-
tve_design
- 训练集:
data/tve_design_dev.csv - 验证集:
data/tve_design_val.csv - 测试集:
data/tve_design_test.csv
- 训练集:
-
junior_science_exam
- 训练集:
data/junior_science_exam_dev.csv - 验证集:
data/junior_science_exam_val.csv - 测试集:
data/junior_science_exam_test.csv
- 训练集:
-
junior_math_exam
- 训练集:
data/junior_math_exam_dev.csv - 验证集:
data/junior_math_exam_val.csv - 测试集:
data/junior_math_exam_test.csv
- 训练集:
-
junior_chinese_exam
- 训练集:
data/junior_chinese_exam_dev.csv - 验证集:
data/junior_chinese_exam_val.csv - 测试集:
data/junior_chinese_exam_test.csv
- 训练集:
-
junior_social_studies
- 训练集:
data/junior_social_studies_dev.csv - 验证集:
data/junior_social_studies_val.csv - 测试集:
data/junior_social_studies_test.csv
- 训练集:
-
tve_mathematics
- 训练集:
data/tve_mathematics_dev.csv - 验证集:
data/tve_mathematics_val.csv - 测试集:
data/tve_mathematics_test.csv
- 训练集:
-
tve_chinese_language



