“只有实验”学科资源数据集
收藏国家数据集管理服务平台2026-04-13 更新2026-04-29 收录
下载链接:
https://www.ndsms.cn/dataRetrieval/datasetDetail/?id=fbea46c0b80e8c438def13edf5b29be4
下载链接
链接失效反馈官方服务:
资源简介:
“只有实验”学科资源数据集由河南汇融数字科技有限公司与河南省实验中学共建,覆盖初高中各学科。数据涵盖语数英理化生史地政等核心课程,规模持续扩展。集成权威题库、历年真题、教辅资料与教师自编试卷,经OCR解析、知识点标注、难度分级与去噪清洗,形成结构化、文本、图像多模态样本300万条。数据已用于“只有实验”大模型微调,显著提升教育问答与解题准确率,计划向河南教育系统开放,支撑智能教学、精准测评与个性化学习等。
"Only Experiment" Subject Resource Dataset was co-built by Henan Huirong Digital Technology Co., Ltd. and Henan Experimental Middle School, covering all subjects for junior and senior high schools. The dataset covers core courses including Chinese, Mathematics, English, Physics, Chemistry, Biology, History, Geography and Politics, with its scale continuously expanding. It integrates authoritative question banks, past examination papers, teaching auxiliary materials and test papers self-compiled by teachers. After undergoing OCR parsing, knowledge point annotation, difficulty grading and denoising cleaning, it has formed 3 million multi-modal samples including structured data, text and images. The dataset has been used for fine-tuning the "Only Experiment" LLM, which significantly improves the accuracy of educational question answering and problem-solving. It is planned to be opened to the Henan education system to support applications such as intelligent teaching, precise assessment and personalized learning.
提供机构:
河南汇融数字科技有限公司
创建时间:
2026-04-10
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集由河南汇融数字科技有限公司与河南省实验中学合作构建,涵盖初高中多学科核心课程,整合了题库、真题、教辅及自编试卷等多源数据,经过解析、标注和清洗处理,形成300万条多模态样本。它已用于大模型微调以提升教育智能应用效果,并计划面向河南教育系统开放,支撑智能教学与个性化学习等场景。
以上内容由遇见数据集搜集并总结生成



