five

Input JSON data for the pipeline of the CLARA Knowledge Graph

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8107150
下载链接
链接失效反馈
官方服务:
资源简介:
CLARA This deposit is part of the CLARA project. The CLARA project aims to empower teachers in the task of creating new educational resources. And in particular with the task of handling the licenses of reused educational resources. The present deposit contains the JSON files extracted from the X5GON Postgresql database. The files are fed to the pipeline of the CLARA project for the creation of 4 different RDF graphs. This is achieved through the use of RDF mappings (RML, RML-star). That pipeline can be found on Gitlab. The results of this pipeline can also be found on Zenodo, on those four different deposits: Standard reification Singleton properties Named graphs RDF-star   Content The JSON files contain information on a total of 45K educational resources, linked to a total of 135K subjects (extracted from DBpedia). Each educational resource is linked to the subjects it talks about. Each of those links has two corresponding scores which represent the certainty of the given link. Those scores are "norm_cosine" and "norm_pageRank". The dataset was cut into multiple JSON files in order to make its processing easier.  There are two type of json files in this deposit: authors_[X].json - Which lists the authors names ER_[X].json - Which lists the educational resources and their related information. That information contains: their title. their description. their language (and language_detected, only the first one is used in the pipeline here). their license. their mimetype. the authors. the date of creation of the resource. a url linking to the resource itself. and finally the subjects (named concepts) associated to the resource. With the corresponding scores.
创建时间:
2023-10-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作