five

Data for Puentes-Mestril et al., Evaluation of scientific impact and knowledge translation facilitated by controlled sharing of human genomic data under the NIH Genomic Data Sharing Policy

收藏
Figshare2025-09-08 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Data_for_Puentes-Mestril_et_al_Evaluation_of_scientific_impact_and_knowledge_translation_facilitated_by_controlled_sharing_of_human_genomic_data_under_the_NIH_Genomic_Data_Sharing_Policy/30032332/1
下载链接
链接失效反馈
官方服务:
资源简介:
Sharing data can increase scientific transparency and reproducibility, allow for novel uses, and spur discovery and innovation beyond the research enterprise. To maximize these benefits, the US National Institutes of Health (NIH) has implemented several data sharing efforts, including the 2014 Genomic Data Sharing (GDS) Policy, which promotes the broad and responsible sharing of genomic data. The Policy expects large-scale human genomic data resulting from NIH funding to be shared through controlled-access data repositories, including the NIH the database of Genotypes and Phenotypes (dbGaP). To understand how sharing genomic data through controlled access repositories contributes to secondary research and facilitates the translation of research to broader societal benefits, we assessed the impact of research that reused data accessed through dbGaP’s Authorized Access System. This dataset contains the results of our analysis.For our analysis, a list of “Manuscripts Citing dbGaP Authorized Access System Data” was downloaded from https://www.ncbi.nlm.nih.gov/gap/summary/pub/ on April 26, 2023 and curated to ensure accuracy and compatibility with our analysis tools. Publications were categorized as a “deposit” if authored by a data submitter, “secondary use” if acknowledging a phs accession number as a source of data, or “reference” if referencing a phs accession number as a resource but not a specific source of data. Publications meeting the criteria of both a deposit and secondary use publication were manually categorized as secondary use. Publications not yet categorized were labeled as “undetermined.” To ensure compatibility with our analysis tools, the PubMed Central IDs (PMCIDs) of all remaining publications on the list were converted to their corresponding PubMed ID (PMID) using the NIH conversion tool (https://www.ncbi.nlm.nih.gov/pmc/tools/idconv/). Publications lacking a corresponding PMID were removed from further analysis.To determine the scientific influence of secondary use publications, we used the NIH Office of Portfolio Analysis’ (OPA) <i>iCite</i> tool (https://icite.od.nih.gov/) to calculate their Relative Citation Ratios (RCRs). For additional bibliometric analysis, we used<i> iSearch</i>, a portfolio analysis platform internal to NIH that contains linked datasets of publications, clinical trials, global grants, and patents.For additional details, please refer to README.docx. For the methodology used to generate each file, please refer to the README tab of the corresponding CSV file.

数据共享可提升科研活动的透明度与可重复性,支持创新应用,并推动科研体系之外的发现与创新。为最大化此类效益,美国国立卫生研究院(National Institutes of Health, NIH)推出了多项数据共享举措,其中包括2014年基因组数据共享(Genomic Data Sharing, GDS)政策,该政策旨在推动基因组数据的广泛且负责任的共享。该政策要求由NIH资助产生的大规模人类基因组数据,需通过受控访问数据存储库进行共享,其中就包括NIH旗下的基因型与表型数据库(database of Genotypes and Phenotypes, dbGaP)。 为探究通过受控访问存储库共享基因组数据如何助力二次研究,并推动研究成果转化为更广泛的社会效益,本研究评估了复用dbGaP授权访问系统获取的数据的相关研究的影响。本数据集包含本次分析的全部结果。 在本次分析中,我们于2023年4月26日从https://www.ncbi.nlm.nih.gov/gap/summary/pub/ 下载了“引用dbGaP授权访问系统数据的手稿”列表,并对其进行整理以确保准确性及与分析工具的兼容性。我们将出版物分为三类:若为数据提交者所撰写,则归类为“存档(deposit)”;若标注了phs登录号作为数据来源,则归类为“二次使用(secondary use)”;若仅将phs登录号作为资源而非特定数据来源进行引用,则归类为“参考文献(reference)”。同时满足存档与二次使用标准的出版物,经人工归类为二次使用。未完成分类的出版物则标记为“未确定(undetermined)”。为确保与分析工具兼容,我们使用NIH的ID转换工具(https://www.ncbi.nlm.nih.gov/pmc/tools/idconv/)将列表中所有剩余出版物的PubMed Central ID(PMC ID)转换为对应的PubMed ID(PMID),无对应PMID的出版物将被移除,不再纳入后续分析。 为评估二次使用出版物的科学影响力,我们使用NIH项目分析办公室(Office of Portfolio Analysis, OPA)的<i>iCite</i>工具(https://icite.od.nih.gov/)计算其相对引用率(Relative Citation Ratios, RCRs)。为开展额外的文献计量分析,我们使用了NIH内部的项目分析平台<i>iSearch</i>,该平台包含出版物、临床试验、全球资助项目及专利的关联数据集。 更多详细信息请参阅README.docx。各文件的生成方法,请参阅对应CSV文件的README标签页。
提供机构:
Petroff, Rebekah L.; Leeds, Hilary; Jacobs, Cheryl; Winchester, Danyelle; Kasperbauer, Tyler; Wann, Ellen; Puentes, Carlos; Hafez, Amy; Paine, Taunton
创建时间:
2025-09-08
二维码
社区交流群
二维码
科研交流群
商业服务