NDC-substances
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10155830
下载链接
链接失效反馈官方服务:
资源简介:
Overview
This is a temporal higher-order network dataset, which here means a sequence of timestamped simplices where each simplex is a set of nodes. Under the Drug Listing Act of 1972, the U.S. Food and Drug Administration releases information on all commercial drugs going through the regulation of the agency, forming the National Drug Code (NDC) Directory. In this dataset, each hyperedge corresponds to an NDC code for a drug, and the nodes are substances that make up the drug. Timestamps are in days and represent when the drug was first marketed. We restricted to hyperedges containing at most 25 nodes.
Statistics
Number of nodes: 5,311
Number of timestamped hyperedges: 112,405
Number of unique hyperedges: 10,025
Source of original data
Source: NDC-substances.
References
If you use this data, please cite the following paper:
Simplicial closure and higher-order link prediction. Austin R. Benson, Rediet Abebe, Michael T. Schaub, Ali Jadbabaie, and Jon Kleinberg. Proceedings of the National Academy of Sciences (PNAS), 2018.
数据集概述
本数据集为时序高阶网络数据集,此处指由带时间戳的单形(simplices)构成的序列,其中每个单形均为一组节点的集合。根据1972年《药品名录法案》(Drug Listing Act of 1972),美国食品药品监督管理局(U.S. Food and Drug Administration,FDA)会公开所有经过其监管的商用药品信息,形成国家药品代码(National Drug Code,NDC)名录。本数据集中,每条超边(hyperedge)对应一款药品的国家药品代码,节点则为构成该药品的物质成分。时间戳以天为单位,代表该药品首次上市的日期。本数据集仅保留节点数不超过25的超边。
统计信息
节点总数:5,311
带时间戳的超边总数:112,405
唯一超边总数:10,025
原始数据来源
来源:NDC-substances
参考文献
若使用本数据集,请引用以下论文:
《单形闭包与高阶链路预测》(Simplicial closure and higher-order link prediction),作者:Austin R. Benson、Rediet Abebe、Michael T. Schaub、Ali Jadbabaie及Jon Kleinberg,发表于《美国国家科学院院刊》(Proceedings of the National Academy of Sciences,PNAS),2018年。
创建时间:
2024-04-04



