EDUKG
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/THU-KEG/EDUKG
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为EDUKG,是一个包含丰富跨学科知识主题及关系的异构可持续K-12教育知识图谱。它从教材中提取知识,旨在提高教育知识的充足性和可持续性。EDUKG包含256个单元、779节课以及来自46本教材的2371个节段,并配有6518道练习题,细分为10602个问题。此外,该数据集还融入了图像数据,平均每个教材节段关联3.11张图片。其规模宏大,拥有超过2.52亿个实体和38.6亿个三元组。该数据集可应用于学习管理系统开发、智能辅导系统研究、教育数据挖掘探索等多种任务。
This dataset, named EDUKG, is a heterogeneous and sustainable K-12 educational knowledge graph encompassing rich interdisciplinary knowledge topics and their relationships. It extracts knowledge from textbooks, aiming to improve the adequacy and sustainability of educational knowledge. EDUKG consists of 256 units, 779 lessons, and 2371 segments extracted from 46 textbooks, accompanied by 6518 practice exercises which are further divided into 10602 individual questions. Additionally, the dataset incorporates image data, with an average of 3.11 images associated with each textbook segment. Boasting a massive scale, it contains over 252 million entities and 3.86 billion triples. This dataset can be applied to various tasks such as learning management system development, intelligent tutoring system research, and educational data mining exploration.
提供机构:
THU-KEG
搜集汇总
数据集介绍

背景与挑战
背景概述
EDUKG是一个大规模的K-12教育知识图谱,由清华大学知识工程组维护,包含38个知识图谱、2.52亿实体和38.6亿三元组。其特点是跨学科、细粒度的本体论设计,并提供了丰富的教育资源和外部异构数据,支持动态更新和维护。
以上内容由遇见数据集搜集并总结生成



