CollabRec: DBpedia Subgraphs (2022-09)
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7772595
下载链接
链接失效反馈官方服务:
资源简介:
The core version of DBpedia has too many entities and statements to train recommendation models in a reasonable time frame, which is why we created two subsets (DB1M, and DBA240) of the core version of DBpedia from September 2022.
File structure
Each dataset is located in their own folder with the following files:
index.tsv.gz is a file in tabular format that maps a simple integer to a URI, which identifies an entity in the KG.
index_labels.tsv.gz is a file that links entities (represented by their index number) to their label and description.
relevant_entities.tsv.gz is a file with all the entities, which occur as subject or/and as object in statements of the subsampled KG.
statements.tsv.gz is a file with all the statements of the subsampled KG. The first column contains the subjects, second column the predicates, and the third column the objects. All those entities are represented by their index number (see index.tsv.gz) and not their URI.
statements.nt.gz is a file with all the statements of the subsampled KG in N-Triples format.
创建时间:
2023-03-28



