five

OpenCitations Index N-Triples dataset of all the citation data

收藏
figshare.com2024-07-01 更新2025-03-22 收录
下载链接:
https://figshare.com/articles/dataset/OpenCitations_Index_N-Triples_dataset_of_all_the_citation_data/24369136/3
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains all the citation data (in N-Triples format) included in the OpenCitations Index, released on July 1, 2024. In particular, any citation in the dataset, defined as an individual of the class cito:Citation, includes the following information:[citation IRI] the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/ci/[OCI]);[property "cito:hasCitingEntity"] the citing entity identified by its OMID URL (https://https://opencitations.net/meta/[OMID]);[property "cito:hasCitedEntity"] the cited entity identified by its OMID URL (https://https://opencitations.net/meta/[OMID]);[property "cito:hasCitationCreationDate"] the creation date of the citation (i.e. the publication date of the citing entity);[property "cito:hasCitationTimeSpan"] the time span of the citation (i.e. the interval between the publication date of the cited entity and the publication date of the citing entity);[type "cito:JournalSelfCitation"] it records whether the citation is a journal self-citations (i.e. the citing and the cited entities are published in the same journal);[type "cito:AuthorSelfCitation"] it records whether the citation is an author self-citation (i.e. the citing and the cited entities have at least one author in common).Note: the information for each citation is sourced from OpenCitations Meta (https://opencitations.net/meta), a database that stores and delivers bibliographic metadata for all bibliographic resources included in the OpenCitations Indexes. The data provided in this dump is therefore based on the state of OpenCitations Meta at the time this collection was generated.This version of the dataset contains:2,012,939,079 citationsThe size of the zipped archive is 65.6 GB, while the size of the unzipped N-Triples files is 1.5 TB.

本数据集囊括了OpenCitations Index(开放引用索引)中所有引用数据(以N-Triples格式存储),并于2024年7月1日公开发布。具体而言,数据集中的每一项引用,即属于cito:Citation类别的个体,均包含以下信息:[引用IRI] 开放引用标识符(OCI),由URL的末尾部分定义,以识别引用(https://w3id.org/oc/index/ci/[OCI]);[属性“cito:hasCitingEntity”] 通过其OMID URL(https://opencitations.net/meta/[OMID])识别的引用实体;[属性“cito:hasCitedEntity”] 通过其OMID URL(https://opencitations.net/meta/[OMID])识别的被引用实体;[属性“cito:hasCitationCreationDate”] 引用创建日期(即引用实体的出版日期);[属性“cito:hasCitationTimeSpan”] 引用时间跨度(即被引用实体与引用实体的出版日期之间的间隔);[类型“cito:JournalSelfCitation”] 记录该引用是否为期刊自引(即引用实体与被引用实体发表在同一期刊上);[类型“cito:AuthorSelfCitation”] 记录该引用是否为作者自引(即引用实体与被引用实体至少有一位共同作者)。注:每项引用的信息均来源于OpenCitations Meta(https://opencitations.net/meta),该数据库存储并提供了OpenCitations Indexes中包含的所有文献资源的书目元数据。因此,本数据集中提供的数据基于生成该集合时的OpenCitations Meta状态。本版本的数据集包含:201,293,907,9条引用,压缩归档文件大小为65.6 GB,解压后的N-Triples文件大小为1.5 TB。
提供机构:
figshare
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作