five

Leveraging metrics to drive data sharing at the Science journals

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.5061%252Fdryad.zkh1893qt
下载链接
链接失效反馈
官方服务:
资源简介:
For the scientific research published in Science to be accessible, it is important that the data, methods, results, and code are transparently reported and openly shared. Science has policies on material, data, and code sharing that support our goals of transparency and openness. These include that all papers must have a data availability statement and that all data and code must be available in the paper or deposited in a permanent public repository. Exceptions are made—for example, if there are security concerns or to protect personal privacy. In 2024, Science partnered with the company DataSeer to determine the extent to which our research papers share data and code. Dataseer uses natural language Processing (NLP) to measure a number of Open Science Indicators in published articles. The dataset “AAAS Open Science Metrics data 2021 to 2024” provides article-level data for 2680 Science papers published between 2021 and 2024. This includes article metadata, such as the doi and publication date, as well as the first listed country for the first author (obtained from OpenAlex), data on whether data and code were generated and whether and how they were shared, and data on whether the paper was preprinted (based on fuzzy matching of the article title and authors against articles from major preprint servers). We used this dataset to calculate aggregate data for data and code sharing, as well as whether data was shared in a repository. All papers had a data availability statement; 69% of papers shared data in a repository, online, or in a supplementary data table (6% of papers did not generate or share data); and 23% of papers shared code (46% of papers did not generate or share code). We compared the Science aggregate data with publicly available data from the Public Library of Science (PLOS) and the academic publisher Taylor and Francis. Summary data are provided in the file “Summary data Science and comparators.” This file gives the total number of publications for each source, the number sharing data overall (in a repository, online, or in the supplementary material), the number sharing data in a repository, and the number generating and sharing code. Overall data sharing was 69% for Science, 74% for PLOS, and 24% for Taylor and Francis, whereas data shared in a repository was 56% for Science, 26% for PLOS, and 11% for Taylor and Francis. For the papers that generated code, code sharing was at 41% for Science, 29% for PLOS, and 8% for Taylor and Francis. This provides a baseline as we implement policies and processes to further improve data and code sharing.
创建时间:
2025-12-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作