PLOS Open Science Indicators
收藏plos.figshare.com2024-09-30 更新2025-03-22 收录
下载链接:
https://plos.figshare.com/articles/dataset/PLOS_Open_Science_Indicators/21687686/7
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains article metadata and information about Open Science Indicators for approximately 112,000 research articles published in PLOS from 1 January 2018 to 31 March 2024 and a set of approximately 23,000 comparator articles published in non-PLOS journals. This is the seventh release of this dataset, which will be updated with new versions as newly published content is analysed.
This version of the Open Science Indicators dataset comprises of 3 components. The first, also included in earlier versions of the dataset, focuses on detection of three Open Science practices by analysing the XML of published research articles:
Sharing of research data, in particular data shared in data repositories Sharing of code Posting of preprints
The dataset provides data and code generation and sharing rates, the location of shared data and code (whether in Supporting Information or in an online repository). It also provides preprint sharing rates as well as details of the shared preprint, such as publication date, URL and preprint server used. Additional data fields are also provided for each article analysed, such as geographic information (‘Country’) and research topics (‘Discipline’).
The second component, first shared in version 4, contains a fourth Open Science Indicator - detection of protocol sharing. This is presented as a preliminary version of the data. The protocols dataset contains information on whether protocols sharing from the article has been detected and the sources of those protocols (i.e. where the protocol was shared).
The third component - and new to this version of the dataset - contains the fifth Open Science Indicator - detection of Study Registration sharing. The Study Registration dataset contains information on whether the article reports a study registration, the registry it was shared in and where in the article the mention of the registration was detected.
Further information on the methods used to collect and analyse the data can be found in Main Documentation folder for the main OSI dataset, the Preliminary Release for Protocols Indicator folder for protocols or the Preliminary Release for Study Registration Indicator folder for study registrations.
Further information on the principles and requirements for developing Open Science Indicators is available in https://doi.org/10.6084/m9.figshare.21640889.
Data folders/files
Main Data Files folder
This folder contains the main OSI dataset files PLOS-Dataset_v7_Jun24.csv and Comparator-Dataset_v7_Jun24.csv, which contain
descriptive metadata, e.g. article title, publication data, author countries, is taken from the article .xml files additional information around the Open Science Indicators derived algorithmically, using Natural Language Processing
and the OSI-Summary-statistics_v7_Jun24.xlsx file contains the summary data for both PLOS-Dataset_v7_Jun24.csv and Comparator-Dataset_v7_Jun24.csv.
Main Documentation folder
This file contains documentation related to the main data files. The file OSI-Methods-Statement_v7_Jun24.pdf describes the methods underlying the data collection and analysis. OSI-Column-Descriptions_v3_Dec23.pdf describes the fields used in PLOS-Dataset_v7_Jun24.csv and Comparator-Dataset_v7_Jun24.csv. OSI-Repository-List_v1_Dec22.xlsx lists the repositories and their characteristics used to identify specific repositories in the PLOS-Dataset_v7_Jun24.csv and Comparator-Dataset_v7_Jun24.csv repository fields.
Preliminary Release for Protocols Indicator folder
This folder contains files related to the new Indicator on protocol sharing. The file Protocols-Dataset_Sep23.csv contains data on protocol sharing pertaining to the PLOS and Comparator corpus of articles. The methods for developing this indicator are described in Protocols-Methods-Statement_Sep23.pdf. The Protocols-Column-Headings_Sep23.pdf file described the column headings used in Protocols-Dataset_Sep23.csv. A summary of the protocols dataset is given in Protocols-Summary-Statistics_Sep23.xlsx, which is used within the related blog post https://theplosblog.plos.org/2023/10/measuring-protocol-sharing.
Preliminary Release for Study Registration Indicator folder
This folder contains the files related to the fifth indicator on study registration. The file Study-Registration_Dataset_Jun24.csv contains the data on study registrations for both PLOS and Comparator articles. The methods for developing this indicator are described in Study-Registration-Methods-Statement_Jun24.pdf and the fields used in the dataset are described in Registration-Column-Headings_Jun24.pdf. A summary of the results of study registration are given in Study-Registration-Summary-Statistics_Jun24.xlsx.
Contact details for further information:
Iain Hrynaszkiewicz, Director, Open Research Solutions, PLOS, ihrynaszkiewicz@plos.org / plos@plos.org
Lauren Cadwallader, Open Research Manager, PLOS, lcadwallader@plos.org / plos@plos.org
Acknowledgements:
Thanks to Allegra Pearce, Tim Vines, Asura Enkhbayar and Scott Kerr of DataSeer for contributing to data acquisition and supporting information.
本数据集汇聚了自2018年1月1日至2024年3月31日期间,发表于PLOS期刊的约112,000篇研究论文的元数据以及开放科学指标信息,并包含约23,000篇发表于非PLOS期刊的对比论文集合。这是该数据集的第七版,随着新发布内容的分析,数据集将持续更新。本版本的开放科学指标数据集由三个部分组成。第一部分,与前几版数据集相同,聚焦于通过分析已发表研究论文的XML内容,检测三种开放科学实践:(1)研究数据的共享,特别是数据仓储中的共享数据;(2)代码的共享;(3)预印本的发布。该数据集提供了数据与代码生成及共享率、共享数据与代码的位置(是否在补充信息中或在线仓储中)等信息,同时提供了预印本共享率以及共享预印本的详细信息,如出版日期、URL及所使用的预印本服务器。此外,对于每篇分析的论文,还提供了额外的数据字段,例如地理信息(国家)和研究主题(学科)。第二部分,首次在第四版中分享,包含第四个开放科学指标——协议共享的检测。该协议共享数据集呈现为初步版本,其中包含有关文章中协议共享是否被检测到以及协议来源(即协议共享的位置)的信息。第三部分——新加入本版本数据集——包含第五个开放科学指标——研究注册共享的检测。研究注册共享数据集包含有关文章是否报告研究注册、共享的注册库以及注册在文章中的提及位置的信息。关于收集和分析数据所采用的方法的详细信息,可参考主OSI数据集的“主要文档”文件夹、协议指标的“初步发布”文件夹或研究注册指标的“初步发布”文件夹。有关开发开放科学指标的原则和要求,可查阅https://doi.org/10.6084/m9.figshare.21640889。数据文件夹/文件:主数据文件文件夹:本文件夹包含主OSI数据集文件PLOS-Dataset_v7_Jun24.csv和Comparator-Dataset_v7_Jun24.csv,其中包含描述性元数据,例如文章标题、出版日期、作者国家等,以及通过自然语言处理和算法方法从文章.xml文件中提取的关于开放科学指标的衍生信息。OSI-Summary-statistics_v7_Jun24.xlsx文件包含PLOS-Dataset_v7_Jun24.csv和Comparator-Dataset_v7_Jun24.csv的汇总数据。主要文档文件夹:本文件夹包含与主数据文件相关的文档。文件OSI-Methods-Statement_v7_Jun24.pdf描述了数据收集和分析的方法,OSI-Column-Descriptions_v3_Dec23.pdf描述了PLOS-Dataset_v7_Jun24.csv和Comparator-Dataset_v7_Jun24.csv中使用的字段,OSI-Repository-List_v1_Dec22.xlsx列出了用于识别特定仓储的仓储及其特征。协议指标初步发布文件夹:本文件夹包含与新的协议共享指标相关的文件。文件Protocols-Dataset_Sep23.csv包含关于PLOS和Comparator文章集合的协议共享数据,该指标的制定方法在Protocols-Methods-Statement_Sep23.pdf中描述,Protocols-Column-Headings_Sep23.pdf文件描述了Protocols-Dataset_Sep23.csv中使用的列标题,协议数据集的总结在Protocols-Summary-Statistics_Sep23.xlsx中给出,该文件用于相关博客文章https://theplosblog.plos.org/2023/10/measuring-protocol-sharing。研究注册指标初步发布文件夹:本文件夹包含与第五个研究注册指标相关的文件。文件Study-Registration_Dataset_Jun24.csv包含关于PLOS和Comparator文章的研究注册数据,该指标的制定方法在Study-Registration-Methods-Statement_Jun24.pdf中描述,数据集中的字段在Registration-Column-Headings_Jun24.pdf中描述,研究注册结果总结在Study-Registration-Summary-Statistics_Jun24.xlsx中给出。联系方式:关于更多信息,请联系Iain Hrynaszkiewicz,PLOS开放研究解决方案总监,邮箱ihrynaszkiewicz@plos.org / plos@plos.org;或Lauren Cadwallader,PLOS开放研究经理,邮箱lcadwallader@plos.org / plos@plos.org。致谢:感谢Allegra Pearce、Tim Vines、Asura Enkhbayar和Scott Kerr等DataSeer成员在数据采集和支持信息方面做出的贡献。
提供机构:
Public Library of Science



