five

SciDisruptor-PubMed

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/hyg4vfgjtn
下载链接
链接失效反馈
官方服务:
资源简介:
Research hypothesis and data interpretation This project is based on the hypothesis that patterns of disruption in scientific publications can be identified in a fully automated and reproducible way through citation networks and the Disruption Index. To test this, a series of Python scripts was developed to extract, organize, and analyze data directly from the PubMed database. What the code demonstrates The scripts automate the entire process: (1) querying articles by keyword using the PubMed E-utilities API; (2) extracting complete article data in XML format; (3) cleaning and organizing the extracted data; (4) building the citation network among the selected articles; and (5) calculating the Disruption Index using the formula proposed by Wu, Wang, and Evans, via the PySciSci package. How to interpret and use the code Each script is modular and designed to run sequentially, with comments and instructions for execution in a Jupyter Notebook environment. Input data is retrieved directly from PubMed, and the outputs include: PMIDs of all articles, structured citation networks, tables with the count of citation types (A, B, C), and the final Disruption Index score. The code can be used by other researchers to replicate this study or to apply the pipeline to other scientific domains.
创建时间:
2025-07-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作