OpenCitations Index CSV dataset of all the citation data
收藏DataCite Commons2026-03-03 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/OpenCitations_Index_CSV_dataset_of_all_the_citation_data/24356626
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains all the citation data (in CSV format) included in the OpenCitation Index (https://opencitations.net/index), released on March 3, 2026. In particular, each line of the CSV file defines a citation, and includes the following information:<b>[field "oci"]</b> the Open Citation Identifier (OCI) for the citation;<b>[field "citing"]</b> the OMID of the citing entity;<b>[field "cited"]</b> the OMID of the cited entity;<b>[field "creation"]</b> the creation date of the citation (i.e. the publication date of the citing entity);<b>[field "timespan"]</b> the time span of the citation (i.e. the interval between the publication date of the cited entity and the publication date of the citing entity);<b>[field "journal_sc"]</b> it records whether the citation is a journal self-citations (i.e. the citing and the cited entities are published in the same journal);<b>[field "author_sc"]</b> it records whether the citation is an author self-citation (i.e. the citing and the cited entities have at least one author in common).<b>Note:</b> the information for each citation is sourced from OpenCitations Meta (https://opencitations.net/meta), a database that stores and delivers bibliographic metadata for all bibliographic resources included in the OpenCitations Index. The data provided in this dump is therefore based on the state of OpenCitations Meta at the time this collection was generated.This version of the dataset contains:2,422,432,262 citationsThe size of the zipped archive is 33 GB, while the size of the unzipped CSV file is around 200 GB.
本数据集包含收录于开放引用索引(OpenCitation Index,https://opencitations.net/index)的全部CSV格式引用数据,于2026年3月3日正式发布。具体而言,CSV文件中的每一行均对应一条引用记录,包含以下字段:
<b>[字段 "oci"]</b>:该引用的开放引用标识符(Open Citation Identifier,OCI);
<b>[字段 "citing"]</b>:引用源实体的OMID;
<b>[字段 "cited"]</b>:被引实体的OMID;
<b>[字段 "creation"]</b>:引用创建日期(即引用源实体的出版日期);
<b>[字段 "timespan"]</b>:引用时间跨度(即被引实体出版日期与引用源实体出版日期之间的时间间隔);
<b>[字段 "journal_sc"]</b>:记录该引用是否为期刊自引(即引用源与被引实体发表于同一期刊);
<b>[字段 "author_sc"]</b>:记录该引用是否为作者自引(即引用源与被引实体存在至少一位共同作者)。
<b>注意:</b>每条引用的信息均源自开放引用元数据库(OpenCitations Meta,https://opencitations.net/meta),该数据库用于存储并分发开放引用索引中所有文献资源的书目元数据。因此本次发布的数据集快照基于数据集生成时的开放引用元数据库状态。
本版本数据集共包含2,422,432,262条引用记录。压缩归档文件的大小为33 GB,解压后的CSV文件大小约为200 GB。
提供机构:
figshare
创建时间:
2023-10-23



