Early Indicator for Data Sharing and Reuse - Supplementary Tables.xlsx
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://figshare.com/articles/dataset/Early_Indicator_for_Data_Sharing_and_Reuse_-_Supplementary_Tables_xlsx/22720399
下载链接
链接失效反馈官方服务:
资源简介:
These data were generated for an investigation of research data repository (RDR) mentions in biuomedical research articles.
Supplementary Table 1 is a discrete subset of SciCrunch RDRs used to study RDR mentions in biomedical literature. We generated this list by starting with the top 1000 entries in the SciCrunch database, measured by citations, removed entries for organizations (such as universities without a corresponding RDR) or non-relevant tools (such as reference managers), updated links, and consolidated duplicates resulting from RDR mergers and name variations. The resulting list of 737 RDRs is shown in with as a base based on a source list of RDRs in the SciCrunch database. The file includes the Research Resource Identifier (RRID), the RDR name, and a link to the RDR record in the SciCrunch database.
Supplementary Table 2 shows the RDRs, associated journals, and article-mention pairs (records) with text snippets extracted from mined Methods text in 2020 PubMed articles. The dataset has 4 components. The first shows the list of repositories with RDR mentions, and includes the Research Resource Identifier (RRID), the RDR name, the number of articles that mention the RDR, and a link to the record in the SciCrunch database. The second shows the list of journals in the study set with at least 1 RDR mention, andincludes the Journal ID, nam, ESSN/ISSN, the total count of publications in 2020, the number of articles that had text available to mine, the number of article-mention pairs (records), number of articles with RDR mentions, the number of unique RDRs mentioned, % of articles with minable text. The third shows the top 200 journals by RDR mention, normalized by the proportion of articles with available text to mine, with the same metadata as the second table. The fourth shows text snippets for each RDR mention, and includes the RRID, RDR name, PubMedID (PMID), DOI, article publication date, journal name, journal ID, ESSN/ISSN, article title, and snippet.
本数据集为探究生物医学研究论文中提及研究数据仓储(Research Data Repository, RDR)的相关研究而生成。
补充表1为用于研究生物医学文献中RDR提及情况的SciCrunch数据库RDR离散子集。我们以SciCrunch数据库中被引次数排名前1000的条目为基础生成该列表,随后剔除了机构(如无对应RDR的大学)或非相关工具(如参考文献管理软件)的条目,更新了链接,并整合了因RDR合并与名称变体产生的重复条目。本次最终得到的737个RDR列表以SciCrunch数据库中的RDR源列表为基准构建。该文件包含研究资源标识符(Research Resource Identifier, RRID)、RDR名称以及指向SciCrunch数据库中RDR记录的链接。
补充表2展示了RDR、关联期刊以及从2020年PubMed论文的方法学文本中提取得到的文本片段对应的论文提及对(记录)。本数据集包含四个组成部分:第一部分为包含RDR提及情况的仓储列表,包含研究资源标识符(RRID)、RDR名称、提及该RDR的论文数量以及指向SciCrunch数据库中对应记录的链接;第二部分为研究集合中至少被提及1次RDR的期刊列表,包含期刊ID、期刊名称、电子国际标准连续出版物号/国际标准连续出版物号(ESSN/ISSN)、2020年总出版物数量、可挖掘文本的论文数量、论文提及对(记录)数量、存在RDR提及的论文数量、被提及的唯一RDR数量以及可挖掘文本的论文占比;第三部分为按RDR提及量排序的前200种期刊,其排序已根据可挖掘文本的论文占比进行归一化处理,元数据与第二部分表格一致;第四部分为每条RDR提及对应的文本片段,包含RRID、RDR名称、PubMed标识符(PubMedID, PMID)、数字对象标识符(Digital Object Identifier, DOI)、论文发表日期、期刊名称、期刊ID、ESSN/ISSN、论文标题以及文本片段。
创建时间:
2023-04-28



