Input JSON data for the pipeline of the CLARA Knowledge Graph
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8107150
下载链接
链接失效反馈官方服务:
资源简介:
CLARA
This deposit is part of the CLARA project. The CLARA project aims to empower teachers in the task of creating new educational resources. And in particular with the task of handling the licenses of reused educational resources.
The present deposit contains the JSON files extracted from the X5GON Postgresql database. The files are fed to the pipeline of the CLARA project for the creation of 4 different RDF graphs. This is achieved through the use of RDF mappings (RML, RML-star).
That pipeline can be found on Gitlab.
The results of this pipeline can also be found on Zenodo, on those four different deposits:
Standard reification
Singleton properties
Named graphs
RDF-star
Content
The JSON files contain information on a total of 45K educational resources, linked to a total of 135K subjects (extracted from DBpedia). Each educational resource is linked to the subjects it talks about. Each of those links has two corresponding scores which represent the certainty of the given link. Those scores are "norm_cosine" and "norm_pageRank".
The dataset was cut into multiple JSON files in order to make its processing easier.
There are two type of json files in this deposit:
authors_[X].json - Which lists the authors names
ER_[X].json - Which lists the educational resources and their related information.
That information contains:
their title.
their description.
their language (and language_detected, only the first one is used in the pipeline here).
their license.
their mimetype.
the authors.
the date of creation of the resource.
a url linking to the resource itself.
and finally the subjects (named concepts) associated to the resource. With the corresponding scores.
创建时间:
2023-10-04



