five

NDC-substances

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10155830
下载链接
链接失效反馈
官方服务:
资源简介:
Overview This is a temporal higher-order network dataset, which here means a sequence of timestamped simplices where each simplex is a set of nodes. Under the Drug Listing Act of 1972, the U.S. Food and Drug Administration releases information on all commercial drugs going through the regulation of the agency, forming the National Drug Code (NDC) Directory. In this dataset, each hyperedge corresponds to an NDC code for a drug, and the nodes are substances that make up the drug. Timestamps are in days and represent when the drug was first marketed. We restricted to hyperedges containing at most 25 nodes. Statistics Number of nodes: 5,311 Number of timestamped hyperedges: 112,405 Number of unique hyperedges: 10,025 Source of original data Source: NDC-substances. References If you use this data, please cite the following paper:  Simplicial closure and higher-order link prediction. Austin R. Benson, Rediet Abebe, Michael T. Schaub, Ali Jadbabaie, and Jon Kleinberg. Proceedings of the National Academy of Sciences (PNAS), 2018.

数据集概述 本数据集为时序高阶网络数据集,此处指由带时间戳的单形(simplices)构成的序列,其中每个单形均为一组节点的集合。根据1972年《药品名录法案》(Drug Listing Act of 1972),美国食品药品监督管理局(U.S. Food and Drug Administration,FDA)会公开所有经过其监管的商用药品信息,形成国家药品代码(National Drug Code,NDC)名录。本数据集中,每条超边(hyperedge)对应一款药品的国家药品代码,节点则为构成该药品的物质成分。时间戳以天为单位,代表该药品首次上市的日期。本数据集仅保留节点数不超过25的超边。 统计信息 节点总数:5,311 带时间戳的超边总数:112,405 唯一超边总数:10,025 原始数据来源 来源:NDC-substances 参考文献 若使用本数据集,请引用以下论文: 《单形闭包与高阶链路预测》(Simplicial closure and higher-order link prediction),作者:Austin R. Benson、Rediet Abebe、Michael T. Schaub、Ali Jadbabaie及Jon Kleinberg,发表于《美国国家科学院院刊》(Proceedings of the National Academy of Sciences,PNAS),2018年。
创建时间:
2024-04-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作