PLOS Open Science Indicators
收藏DataCite Commons2024-12-18 更新2024-11-06 收录
下载链接:
https://plos.figshare.com/articles/dataset/PLOS_Open_Science_Indicators/21687686/8
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains article metadata and information about Open Science Indicators for approximately 118,000 research articles published in PLOS from 1 January 2018 to 30 June 2024 and a set of approximately 24,000 comparator articles published in non-PLOS journals. This is the eighth release of this dataset, which will be updated with new versions as newly published content is analysed.
This version of the Open Science Indicators dataset comprises of 3 components. The first, also included in earlier versions of the dataset, focuses on detection of three Open Science practices by analysing the XML of published research articles:
Sharing of research data, in particular data shared in data repositories
Sharing of code
Posting of preprints
The dataset provides data and code generation and sharing rates, the location of shared data and code (whether in Supporting Information or in an online repository). It also provides preprint sharing rates as well as details of the shared preprint, such as publication date, URL and preprint server used. Additional data fields are also provided for each article analysed, such as geographic information (‘Country’) and research topics (‘Discipline’).
The second component, first shared in version 4, contains a fourth Open Science Indicator - detection of protocol sharing. This is presented as a preliminary version of the data. The protocols dataset contains information on whether protocols sharing from the article has been detected and the sources of those protocols (i.e. where the protocol was shared).
The third component, first shared in version 7, contains the fifth Open Science Indicator - detection of Study Registration sharing. The Study Registration dataset contains information on whether the article reports a study registration, the registry it was shared in and where in the article the mention of the registration was detected.
Further information on the methods used to collect and analyse the data can be found in Main Documentation folder for the main OSI dataset, the Preliminary Release for Protocols Indicator folder for protocols or the Preliminary Release for Study Registration Indicator folder for study registrations.
Further information on the principles and requirements for developing Open Science Indicators is available in https://doi.org/10.6084/m9.figshare.21640889.
<br>
<strong>Data folders/files</strong>
Main Data Files folder
This folder contains the main OSI dataset files PLOS-Dataset_v8_Sep24.csv and Comparator-Dataset_v8_Sep24.csv, which contain
descriptive metadata, e.g. article title, publication data, author countries, is taken from the article .xml files
additional information around the Open Science Indicators derived algorithmically, using Natural Language Processing
and the OSI-Summary-statistics_v8_Sep24.xlsx file contains the summary data for both PLOS-Dataset_v8_Sep24.csv and Comparator-Dataset_v8_Sep24.csv.
Main Documentation folder
This file contains documentation related to the main data files. The file OSI-Methods-Statement_v8_Sep24.pdf describes the methods underlying the data collection and analysis. OSI-Column-Descriptions_v3_Dec23.pdf describes the fields used in PLOS-Dataset_v8_Sep24.csv and Comparator-Dataset_v8_Sep24.csv. OSI-Repository-List_v1_Dec22.xlsx lists the repositories and their characteristics used to identify specific repositories in the PLOS-Dataset_v8_Sep24.csv and Comparator-Dataset_v8_Sep24.csv repository fields.
Preliminary Release for Protocols Indicator folder
This folder contains files related to the new Indicator on protocol sharing. The file Protocols-Dataset_Sep23.csv contains data on protocol sharing pertaining to the PLOS and Comparator corpus of articles. The methods for developing this indicator are described in Protocols-Methods-Statement_Sep23.pdf. The Protocols-Column-Headings_Sep23.pdf file described the column headings used in Protocols-Dataset_Sep23.csv. A summary of the protocols dataset is given in Protocols-Summary-Statistics_Sep23.xlsx, which is used within the related blog post https://theplosblog.plos.org/2023/10/measuring-protocol-sharing.
<br>
Preliminary Release for Study Registration Indicator folder
This folder contains the files related to the fifth indicator on study registration. The file Study-Registration_Dataset_Jun24.csv contains the data on study registrations for both PLOS and Comparator articles. The methods for developing this indicator are described in Study-Registration-Methods-Statement_Jun24.pdf and the fields used in the dataset are described in Registration-Column-Headings_Jun24.pdf. A summary of the results of study registration are given in Study-Registration-Summary-Statistics_Jun24.xlsx.
<br>
<strong>Contact details for further information:</strong>
Iain Hrynaszkiewicz, Director, Open Research Solutions, PLOS, ihrynaszkiewicz@plos.org / plos@plos.org
Lauren Cadwallader, Open Research Manager, PLOS, lcadwallader@plos.org / plos@plos.org
<br>
<strong>Acknowledgements:</strong>
Thanks to Allegra Pearce, Tim Vines, Asura Enkhbayar and Scott Kerr of DataSeer for contributing to data acquisition and supporting information.
本数据集涵盖2018年1月1日至2024年6月30日期间发表于公共科学图书馆(PLOS)的约11.8万篇研究论文的元数据,以及开放科学指标(Open Science Indicators)相关信息,同时包含约2.4万篇发表于非PLOS期刊的对照论文的相关信息。本数据集为第8次发布版本,后续将针对新发表的内容开展分析并更新至新版本。
本版开放科学指标数据集包含3个组成部分。第一个组成部分(此前版本的数据集亦包含该部分)聚焦于通过分析已发表研究论文的XML文件,识别三类开放科学实践:研究数据共享(尤其是存储于数据仓储中的数据共享)、代码共享及预印本发布。
本数据集提供了数据与代码的生成及共享率,以及共享数据与代码的存储位置(是否附于支持信息或在线仓储中);同时提供了预印本共享率,以及已共享预印本的详细信息,如发布日期、URL及所用的预印本平台。此外,还为每篇经分析的论文提供了额外的数据字段,例如地理信息(「国家/地区」)与研究主题(「学科」)。
第二个组成部分首次发布于第4版,包含第四项开放科学指标——研究方案共享识别。该部分数据以预览版形式提供。研究方案数据集涵盖了是否从论文中检测到研究方案共享,以及这些方案的共享来源(即方案的共享位置)等信息。
第三个组成部分首次发布于第7版,包含第五项开放科学指标——研究注册共享识别。研究注册数据集涵盖了论文是否报告了研究注册、注册所用的仓储平台,以及论文中提及注册的位置等信息。
有关本数据集收集与分析方法的更多信息,可查阅主开放科学指标(OSI)数据集的「主文档文件夹」、协议指标预览版文件夹(针对协议相关数据),或研究注册指标预览版文件夹(针对研究注册相关数据)。
有关开发开放科学指标的原则与要求的更多信息,可访问https://doi.org/10.6084/m9.figshare.21640889获取。
<br>
<strong>数据文件夹/文件</strong>
「主数据文件文件夹」
本文件夹包含主开放科学指标数据集文件PLOS-Dataset_v8_Sep24.csv与Comparator-Dataset_v8_Sep24.csv,二者收录了从论文XML文件中提取的描述性元数据(如论文标题、发表信息、作者所属国家),以及通过自然语言处理(Natural Language Processing)算法推导得到的开放科学指标相关附加信息。OSI-Summary-statistics_v8_Sep24.xlsx文件则包含上述两个CSV文件的汇总统计数据。
「主文档文件夹」
本文件夹包含与主数据文件相关的文档:OSI-Methods-Statement_v8_Sep24.pdf阐述了数据收集与分析的底层方法;OSI-Column-Descriptions_v3_Dec23.pdf说明了PLOS-Dataset_v8_Sep24.csv与Comparator-Dataset_v8_Sep24.csv中所用的字段;OSI-Repository-List_v1_Dec22.xlsx列出了用于识别上述两个CSV文件中仓储字段的各类仓储及其特征。
「协议指标预览版文件夹」
本文件夹包含与研究方案共享新指标相关的文件:Protocols-Dataset_Sep23.csv收录了PLOS与对照论文集的协议共享数据;Protocols-Methods-Statement_Sep23.pdf阐述了该指标的开发方法;Protocols-Column-Headings_Sep23.pdf说明了Protocols-Dataset_Sep23.csv所用的列标题;Protocols-Summary-Statistics_Sep23.xlsx提供了该数据集的汇总信息,相关博客文章https://theplosblog.plos.org/2023/10/measuring-protocol-sharing中也用到了该文件。
<br>
「研究注册指标预览版文件夹」
本文件夹包含与研究注册第五项指标相关的文件:Study-Registration_Dataset_Jun24.csv收录了PLOS与对照论文的研究注册相关数据;Study-Registration-Methods-Statement_Jun24.pdf阐述了该指标的开发方法;Registration-Column-Headings_Jun24.pdf说明了数据集中所用的字段;Study-Registration-Summary-Statistics_Jun24.xlsx提供了研究注册结果的汇总信息。
<br>
<strong>进一步咨询联系方式:</strong>
伊恩·赫里纳斯基维奇(Iain Hrynaszkiewicz),PLOS开放研究解决方案总监,邮箱:ihrynaszkiewicz@plos.org / plos@plos.org
劳伦·卡德瓦拉德(Lauren Cadwallader),PLOS开放研究经理,邮箱:lcadwallader@plos.org / plos@plos.org
<br>
<strong>致谢:</strong>
感谢DataSeer的阿莱格拉·皮尔斯(Allegra Pearce)、蒂姆·瓦因斯(Tim Vines)、阿苏拉·恩赫巴亚尔(Asura Enkhbayar)与斯科特·凯尔(Scott Kerr)为数据采集与信息支持提供的贡献。
提供机构:
Public Library of Science
创建时间:
2024-09-30



