Metadata for Datasets and Relationships
收藏DataCite Commons2024-08-23 更新2024-07-13 收录
下载链接:
https://figshare.com/articles/dataset/Metadata_for_Datasets_and_Relationships/22790810/5
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains two tables. One table contains metadata for "citable" datasets (datasets that have either a DOI or a compact identifier). The other table contains the relationships between each pair of datasets in the first table.We generated this corpus of dataset metadata by crawling the Web to find pages with schema.org or DCAT metadata indicating that the page contains a dataset. The metadata for datasets includes information such as the dataset’s name, description, provider, creation date, Digital Object Identifiers (DOI), and more. Out of the 46 million dataset pages that have schema.org, we publish this subset of 4.3 million dataset-metadata entries that are citable. We also include an additional table on relationships between these datasets.
本数据集包含两张数据表。其中一张数据表存储可引用数据集(即带有数字对象标识符(Digital Object Identifiers, DOI)或紧凑标识符的数据集)的元数据;另一张数据表则收录第一张数据表中各数据集的两两关联关系。本数据集的元数据语料库通过网络爬取构建:我们抓取了所有带有schema.org或DCAT元数据且标注页面包含数据集的网页。数据集元数据涵盖其名称、描述、提供方、创建日期、数字对象标识符(Digital Object Identifiers, DOI)等信息。在总计4600万个带有schema.org元数据的数据集网页中,我们选取其中可引用的430万条数据集元数据条目作为本数据集的内容发布。此外,本数据集还附带一张收录上述数据集间关联关系的额外数据表。
提供机构:
figshare
创建时间:
2024-07-12



