1.3亿道小学到大学中文试题文本结构化解析处理数据
收藏数据堂2025-03-08 收录
下载链接:
https://www.datatang.com/dataset/1448
下载链接
链接失效反馈官方服务:
资源简介:
1.3亿道小学到大学中文试题文本数据,K12试题数据总量2,087万道(其中带解析1,600万道),大学和职业试题数据总量1.17亿道(其中带解析700万道);K12试题包含数据质量级别,题型,学段,题目难度,年级,科目,答案,解析等字段;大学和职业试题包含答案,解析,类别等字段;K12试题学段为小学、初中和高中,科目为语文、数学、英语、历史、地理、政治、生物、物理、化学和科学;大学和职业试题领域为公安、公考、医学、外语、学历、工程、教育、法律、经济、职业、计算机、资格和金融等;题型包含多项选择题、单项选择题、判断题、填空题等;该数据可用于大模型学科知识增强任务
This dataset contains 130 million Chinese test question text data ranging from primary school to college and vocational levels. Specifically, the total number of K12 test questions is 20.87 million, among which 16 million are equipped with detailed explanations; the total number of university and vocational test questions is 117 million, of which 7 million have detailed explanations. K12 test questions include fields such as data quality level, question type, education stage, difficulty level, grade, subject, answer and explanation. The education stages of K12 test questions cover primary school, junior high school and senior high school, and the subjects include Chinese, Mathematics, English, History, Geography, Politics, Biology, Physics, Chemistry and Science. University and vocational test questions include fields such as answer, explanation and category. Their application domains cover public security, civil service examination, medicine, foreign languages, academic credentials, engineering, education, law, economics, career, computer science, qualification certification and finance. The question types include multiple-choice questions, single-choice questions, true-false questions, fill-in-the-blank questions and more. This dataset can be used for subject knowledge enhancement tasks of large language models (LLMs).
提供机构:
数据堂
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



