ORCID Public Data File 2020
收藏Figshare2020-10-13 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/ORCID_Public_Data_File_2020/13066970
下载链接
链接失效反馈官方服务:
资源简介:
These files contain a snapshot of all public data in the ORCID Registry associated with an ORCID record that was created or claimed by an individual as of October 1st, 2020. ORCID publishes this file once per year under a Creative Commons CC0 1.0 Universal public domain dedication. This means that, to the extent possible under law, ORCID has waived all copyright and related or neighbouring rights to the Public Data File. For more information on the file, see https://orcid.org/content/orcid-public-data-file-use-policyThe file contains the public information associated with each user's ORCID record. The data is available in XML format and is further divided into separate files for easier management. One file contains the full record summary for each record. The rest of the data is divided into 11 files which contain the activities for each record including full work data.Below is more complete description of how the data is structured.Summaries fileName: ORCID_2020_10_summaries.tar.gzDescription: Contains all the existing summaries, when extracted, it will generate the following file structure: summaries/[3 digits checksum]/[iD].xmlExample: If you are looking for the summary of iD '0000-0002-7869-831X', decompress the file and you will find the summary under 'summaries/31X/0000-0002-7869-831X.xml'.Activities filesNamed: - ORCID_2020_10_activites_0.tar.gz - ORCID_2020_10_activites_1.tar.gz - ORCID_2020_10_activites_2.tar.gz - ORCID_2020_10_activites_3.tar.gz - ORCID_2020_10_activites_4.tar.gz - ORCID_2020_10_activites_5.tar.gz - ORCID_2020_10_activites_6.tar.gz - ORCID_2020_10_activites_7.tar.gz - ORCID_2020_10_activites_8.tar.gz - ORCID_2020_10_activites_9.tar.gz - ORCID_2020_10_activites_X.tar.gzDescription: Consists of 11 .tar.gz files, each file contains the public activities that belongs to an iD that contains a given checksum. The file hierarchy is as follows: [checksum]/[3 digits checksum]/[iD]/[activity type]/[iD]_[activity_type]_[putcode].xmlExamples: If you are looking for the public activities that belong to `0000-0002-7869-831X: Decompress the file 'ORCID_2020_10_activites_X.tar.gz'.You will find all the public activities under 'X/31X/0000-0002-7869-831X/' which are then sub-divided in folders for each activity type.If you are looking for all the employments that belong to '0000-0002-7869-831X': Decompress the file 'ORCID_2020_10_activites_X.tar.gz',Navigate to 'X/31X/0000-0002-7869-831X/employments'.If you are looking for the employment with put-code '7923980' that belongs to '0000-0002-7869-831X' : Decompress the file 'ORCID_2020_10_activites_X.tar.gz'.You will find that employment under 'X/31X/0000-0002-7869-831X/employments/0000-0002-7869-831X_employments_7923980.xml'.Companion Resources:https://github.com/ORCID/orcid-model/tree/master/src/main/resources/common_3.02019: File: https://doi.org/10.23640/07243.9988322.v22018 File: https://doi.org/10.23640/07243.7234028.v12017 File: https://doi.org/10.6084/m9.figshare.5479792.v12016 File: https://doi.org/10.6084/m9.figshare.41340272015 File: https://dx.doi.org/10.6084/m9.figshare.15827052014 File: http://dx.doi.org/10.14454/07243.2014.0012013 File: http://dx.doi.org/10.14454/07243.2013.001
本数据集文件包含截至2020年10月1日由个人创建或认领的ORCID记录相关的ORCID注册处全部公开数据快照。ORCID每年发布一次此类文件,采用知识共享CC0 1.0通用公共领域授权协议。这意味着,在法律允许的最大范围内,ORCID已放弃该公开数据文件的全部著作权及相关或邻接权利。如需了解该文件的更多信息,请访问https://orcid.org/content/orcid-public-data-file-use-policy。
该文件包含每位用户ORCID记录关联的公开信息。数据以XML格式提供,为便于管理进一步拆分为多个独立文件。其中一个文件包含每条记录的完整记录摘要,其余数据拆分为11个文件,用于存储每条记录的各类活动信息,包括完整的作品数据。
以下为数据结构的完整说明:
### 摘要文件
文件名为`ORCID_2020_10_summaries.tar.gz`。说明:包含所有现存的记录摘要。解压后将生成如下文件目录结构:`summaries/[3位校验和]/[ID].xml`。示例:若需查找ID为'0000-0002-7869-831X'的记录摘要,解压该文件后,可在`summaries/31X/0000-0002-7869-831X.xml`路径下找到该摘要。
### 活动文件
命名格式如下:
- `ORCID_2020_10_activites_0.tar.gz`
- `ORCID_2020_10_activites_1.tar.gz`
- `ORCID_2020_10_activites_2.tar.gz`
- `ORCID_2020_10_activites_3.tar.gz`
- `ORCID_2020_10_activites_4.tar.gz`
- `ORCID_2020_10_activites_5.tar.gz`
- `ORCID_2020_10_activites_6.tar.gz`
- `ORCID_2020_10_activites_7.tar.gz`
- `ORCID_2020_10_activites_8.tar.gz`
- `ORCID_2020_10_activites_9.tar.gz`
- `ORCID_2020_10_activites_X.tar.gz`
说明:该组文件包含11个`.tar.gz`压缩包,每个文件存储包含特定校验和的ID关联的公开活动信息。文件目录层级如下:`[校验和]/[3位校验和]/[ID]/[活动类型]/[ID]_[活动类型]_[PUT码(putcode)].xml`。
示例1:若需查找ID为'0000-0002-7869-831X'的公开活动,需解压`ORCID_2020_10_activites_X.tar.gz`文件,所有公开活动可在`X/31X/0000-0002-7869-831X/`路径下找到,该目录下还会按活动类型进一步细分文件夹。
示例2:若需查找ID为'0000-0002-7869-831X'的所有雇佣信息,需解压`ORCID_2020_10_activites_X.tar.gz`文件,导航至`X/31X/0000-0002-7869-831X/employments`路径即可。
示例3:若需查找ID为'0000-0002-7869-831X'且PUT码为'7923980'的雇佣信息,需解压`ORCID_2020_10_activites_X.tar.gz`文件,该雇佣信息可在`X/31X/0000-0002-7869-831X/employments/0000-0002-7869-831X_employments_7923980.xml`路径下找到。
### 配套资源
https://github.com/ORCID/orcid-model/tree/master/src/main/resources/common_3.0
2019年数据集文件:https://doi.org/10.23640/07243.9988322.v2
2018年数据集文件:https://doi.org/10.23640/07243.7234028.v1
2017年数据集文件:https://doi.org/10.6084/m9.figshare.5479792.v1
2016年数据集文件:https://doi.org/10.6084/m9.figshare.4134027
2015年数据集文件:https://dx.doi.org/10.6084/m9.figshare.1582705
2014年数据集文件:http://dx.doi.org/10.14454/07243.2014.001
2013年数据集文件:http://dx.doi.org/10.14454/07243.2013.001
创建时间:
2020-10-13



