five

Datasets for training and for testing ligand-based reverse screening to predict drug targets.

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7534174
下载链接
链接失效反馈
官方服务:
资源简介:
Bioactivity data were obtained from the ChEMBL (version 25) and the Reaxys® (version 03.2019) databases for training/screening and testing target prediction of small molecule drug by reverse screening. A short extract of the raw ChEMBL data for training is given in Supplementary Table 1 (Supp_Table_1.xlsx) to show three lines corresponding to an active, an inactive and an ‘gray area’ datapoints, respectively. Processed data are deposited here. The screening set file contains (CHEMBL-screeningset.csv), for each active compound, the standardized SMILES, the ChEMBLID, the number of experimental target(s) and their UniProt identifier(s). The test set file (Reaxys-testset-with300-SMILES.csv) contains, for each 364201 active compound, the ReaxysID, the number of experimental target(s) and the UniProt identifier(s). For Reaxys users, the chemical structure can be obtained through bulk request on the website. Access to www.reaxys.com and to Reaxys data can be obtained by contacting Elsevier directly. The first 300 entries display the standardized SMILES so that every reader can reproduce the results obtained by the reverse screening exercise. Further datasets can be obtained at www.swisstargetprediction.ch/download.php
创建时间:
2023-01-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作