Analysis of research data for 11 Institutions - Data Monitor
收藏doi.org2025-01-15 收录
下载链接:
http://doi.org/10.17632/k5p45z33kb.3
下载链接
链接失效反馈官方服务:
资源简介:
We conducted an analysis to confirm our observations that only a very small percentage of public research data is hosted in the Institutional Data Repositories, while the vast majority is published in the open domain-specific and generalist data repositories.
For this analysis, we selected 11 institutions, many of which have been our evaluation partners. For each institution, we counted the number of datasets published in their Institutional Data Repository (IDR) and tracked the number of public research datasets hosted in external data repositories via the Data Monitor API. External tracking was based on the corpus of 14+ mln data records checked against the institutional SciVal ID. One institution didn’t have an IDR.
We found out that 10 out of 11 institutions had most of their public research data hosted outside of their institution, where by research data we mean not only datasets, but a broader notion that includes, for example, software.
We will be happy to expand it by adding more institutions upon request.
Note: This is version 2 of the earlier published dataset. The number of datasets published and tracked in the Monash Institutional Data Repository has been updated based on the information provided by the Monash Library. The number of datasets in the NTU Institutional Data Repository now includes datasets only. Dataverses were excluded to avoid double counting.
本研究旨在验证我们所观察到的现象,即仅有极少数公开研究数据存储在机构数据仓储中,而绝大多数数据则发布在开放领域的专业和综合数据仓储中。为此分析,我们选取了11家机构,其中许多机构已成为我们的评估合作伙伴。对于每个机构,我们统计了其机构数据仓储(IDR)中发布的数据集数量,并通过数据监控API追踪了托管在外部数据仓储中的公开研究数据集数量。外部追踪基于14+百万条数据记录的语料库,这些记录与机构的SciVal ID进行了比对。其中一家机构没有设立IDR。我们发现,在11家机构中,有10家机构的大部分公开研究数据均托管于其机构之外,其中研究数据不仅包括数据集,还包括更广泛的概念,例如软件。我们将根据请求添加更多机构以扩展该分析。注:这是先前发布数据集的第二个版本。根据Monash图书馆提供的信息,Monash机构数据仓储中发布和追踪的数据集数量已更新。NTU机构数据仓储中的数据集数量现在仅包括数据集,已排除数据立方体以避免重复计数。
提供机构:
doi.org



