OpenCitations Index N-Triples dataset of the provenance information of all the citation data
收藏figshare.com2024-07-01 更新2025-03-24 收录
下载链接:
https://figshare.com/articles/dataset/OpenCitations_Index_N-Triples_dataset_of_the_provenance_information_of_all_the_citation_data/24417736/2
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains the provenance information (in N-Triples format) of all the citation data included in the OpenCitation Index, released on 29 November 2023. In particular, any citation in the dataset includes the following provenance information:[citation IRI] the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/ci/[OCI]);[property "prov:wasAttributedTo"] the IRI of the agent that has created the citation data;[property "prov:hadPrimarySource"] the IRI of the source dataset from where the citation data have been extracted;[property "prov:generatedAtTime"] the creation time of the citation data.[propert "prov:invalidatedAtTime"] the start of the destruction, cessation, or expiry of an existing entity by an activity.[property "oco:hasUpdateQuery"] the UPDATE SPARQL query that keeps track of which metadata have been modified.The size of the zipped archive is 79 GB, while the size of the unzipped N-Triples files is 2.5 TB.
本数据集收录了截至2023年11月29日发布的OpenCitation Index中所有引用数据的来源信息(采用N-Triples格式)。具体而言,数据集中的每一项引用均包含以下来源信息:[引用标识符 IRI] Open Citation标识符(OCI),该标识符定义于识别引用的URL的最后一部分(https://w3id.org/oc/index/ci/[OCI]);[属性“prov:wasAttributedTo”] 创建引用数据的实体的IRI;[属性“prov:hadPrimarySource”] 提取引用数据的原始数据集的IRI;[属性“prov:generatedAtTime”] 引用数据的创建时间;[属性“prov:invalidatedAtTime”] 通过活动开始对现有实体进行破坏、终止或失效的时间。[属性“oco:hasUpdateQuery”] 用于跟踪哪些元数据已被修改的UPDATE SPARQL查询。压缩归档文件的大小为79 GB,而展开的N-Triples文件大小为2.5 TB。
提供机构:
figshare.com



