five

MBC and ECBL Libraries: outstanding tools for drug discovery

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7881097
下载链接
链接失效反馈
官方服务:
资源简介:
UPDATE. New in this revision: python scripts to process DBs and calculate the percentage of molecules which pass the Veber and Ghose filters. Two new DBs were also added and considered for the analysis. Data and scripts to reproduce all the graphics reported in the Manuscript entitled: "MBC and ECBL Libraries: outstanding tools for drug discovery". List of analyzed DBs: MBC2016 (Total entries: 1,096 cmpds; 7.39% excluded from properties analysis - QikProp failure). MBC2022 (Total entries: 2,577 cmpds; 3.14% excluded from properties analysis - QikProp failure). ECBL (Total entries: 101,021 cmpds; 0.20% excluded from properties analysis - QikProp failure). ChEMBL v.31 (Total entries 1,908,325 cmpds; 2.97% excluded from properties analysis - QikProp failure). DrugBank v.5.0 (Total entries 10,981 cmpds; 4.13% excluded from properties analysis - QikProp failure). ZINC20 (Total entries 10,723,360 cmpds; 0.61% excluded from properties analysis - QikProp failure). NuBBE (Total entries 2,223 cmpds) - NEW Approved drugs (Total entries: 3,140 cmpds) - NEW Files: QikProp_properties.docx: doc file containing the full list of QikProp properties calculated for each analyzed DB. DATA_comparison.xlsx: excel file containing data used to reproduce plots in Figure 4 of the MS. Murcko_scaffold_percentages: distribution (%) of the first 50 most populated Murcko scaffolds for MBC2016, MBC2022 and ECBL. Murcko_scaffolds_comparison: distribution (count) of the first 94 common Murcko scaffolds for MBC2016, MBC2022 and ECBL. QikProp properties for all the analyzed DBs (8 files; CSV format). SMILES codes for all the analyzed DBs (8 files; SMI format).  joinplots.py: python script to generate the 2D plots in Figure 2 of the MS. fingerprint_similarity.py: python script to run and generate the Tanimoto similarity plots in Figure 3 of the MS. calc_kde.py: python script to run kernel density analysis reported in Figure 5 of the MS. Veber_filter.py: python script to generate data presented in Table 1 of the MS. (NEW) Ghose filter.py:  python script to generate data presented in Table 1 of the MS. (NEW)
创建时间:
2023-08-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作