中国移动教育行业图文数据集
收藏国家数据集管理服务平台2026-05-28 更新2026-04-29 收录
下载链接:
https://www.ndsms.cn/dataRetrieval/datasetDetail/?id=1c9946383982bb6c2ff0418f8dd7e915
下载链接
链接失效反馈官方服务:
资源简介:
本数据集为带有完整答案解析的多学科试题图文数据集,涵盖小学、初中、高中全学段的多学科试题图文数据,学科覆盖语文、数学、科学、英语、地理、生物、物理、化学、历史、政治共10个核心学科,试题题型包含问答题、多项选择题、单项选择题、判断题、填空题等全品类主流题型。样本兼具试题图像信息与结构化文本内容,具备强教育属性、学段学科覆盖全面性与任务适配性,适合用于教育领域大模型训练、智能题库构建、自动答题与解析生成、试题分类管理、智能批改系统训练及教育内容合规审核。
This is a multi-disciplinary exam question image-text dataset with complete answer explanations. It covers image-text data of exam questions across all school stages including primary school, junior high school and senior high school, and spans 10 core subjects: Chinese, Mathematics, Science, English, Geography, Biology, Physics, Chemistry, History and Politics. The question types cover all mainstream categories such as open-ended questions, multiple-choice questions, single-choice questions, true-false questions and fill-in-the-blank questions. Each sample contains both the image information of the exam question and structured text content. This dataset has strong educational attributes, comprehensive coverage of school stages and subjects, and good task adaptability, making it suitable for training large language models (LLMs) in the education field, constructing intelligent question banks, generating automatic answers and explanations, managing exam question classification, training intelligent grading systems and conducting compliance audits of educational content.
提供机构:
中移九天人工智能科技(北京)有限公司
创建时间:
2026-04-25
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个包含完整答案解析的多学科试题图文集合,覆盖小学至高中全学段,涵盖语文、数学等10个核心学科及多种主流题型,样本兼具图像与结构化文本。数据规模为87.5GB,适用于教育领域大模型训练、智能题库构建和自动答题解析等应用场景。
以上内容由遇见数据集搜集并总结生成



