five

OpenCitations Meta RDF dataset of all bibliographic metadata and its provenance information

收藏
DataCite Commons2025-04-01 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/OpenCitations_Meta_RDF_dataset_of_all_bibliographic_metadata_and_its_provenance_information/21747536/5
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains all the bibliographic metadata and its provernance information (in JSON-LD format) included in OpenCitations Meta.The data and the provenance are organized through a complex structure of folders and subfolders, that allows you to quickly find any entity from its URI. The first level consists of the following folders, that are provided zipped and separately:<b>[folder "ar"]</b>: contains the data and provenance of the responsible agent type entities (http://purl.org/spar/pro/RoleInTime);<b>[folder "br"]</b>: contains the data and provenance of the entities of type bibliographic resource (http:///purl.org/spar/fabio/Expression);<b>[folder "id"]</b>: contains the data and provenance of the identifier entities (http://purl.org/spar/datacite/Identifier);<b>[folder "ra"]</b>: contains the data and provenance of the responsible agent type entities (http://xmlns.com/foaf/0.1/Agent);<b>[folder "ra"]</b>: contains the data and provenance of resource embodiment entities (http://purl.org/spar/fabio/Manifestation).The inner folders are named through the <b>supplier prefix</b> of the contained entities. It is a prefix that allows you to recognize the entity membership index (e.g., OpenCitations Meta corresponds to <b>06*0</b>).After that, the folders have <b>numeric names</b>, which refer to the range of contained entities. For example, the 10000 folder contains entities from 1 to 10000. Inside, you can find the <b>zipped </b>RDF data.At the same level, additional folders containing the <b>provenance </b>are named with the same criteria already seen. Then, the 1000 folder includes the provenance of the entities from 1 to 1000. The provenance is located inside a folder called <b>prov</b>, also in zipped JSON-LD format.For example, data related to the entity is located in the folder /br/06250/10000/1000/1000.zip, while information about provenance in /br/06250/10000/1000/prov/1000.zipThis version of the dataset contains:105,953,699 bibliographic entities338,173,282 authors and 2,523,200 editors (counted by their roles, without disambiguating individuals)691,262 publication venues36,679 publishersThe weight of the sum of the archives is 38.1 GB on an NTFS filesystem, which does not vary once extracted because they contain zipped JSON files. We recommend processing such files as zipped without extracting them.Additional information about OpenCitations Meta at the official webpage.<br>
提供机构:
figshare
创建时间:
2023-10-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作