five

TeratoDB

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://figshare.com/articles/dataset/TeratoDB/7880093
下载链接
链接失效反馈
官方服务:
资源简介:
These files contain a set of 585 molecules classified for teratogenicity, with structural and mechanism of action data. The dataset has been balanced as much as possible with the data available in the scientific literature and public repositories like the EPA's IRIS and the NLM's TOXNET Hazardous Substance Data Bank. The goal of this dataset is to collect the available data in a format that will allow for easy prototyping of machine learning applications for teratogenic toxicity. The file 'drug_data.xlsx' contains the list of molecules with an assigned teratogenicity category (1 for teratogenic agents, -1 for non-teratogenic agents). It also contains ChEMBL, DrugBank and PubChem identifiers, when available, and a SMILES representation of the molecule. This list has been curated by hand. The file 'drug_uniprot_targets.xlsx', contains 2290 drug-target relationships between the 585 drugs from the previous file and 757 molecular targets that were found to be annotated in ChEMBL and DrugBank. The files used to process said data have been included in 'target_processing.zip'. Uniprot identifiers were used for all of the targets. Drug-target relationships found in both databases were included only once. Information about the Mechanism of Action was included when available. Finally, the file called 'target_list.xlsx' contains the list of the unique 758 protein targets found in any or both of the databases. They include Uniprot and ChEMBL identifiers, and a description of said targets when available.
创建时间:
2019-04-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作