five

MFFHA-DTI Dataset. Zhao et al.

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/9fw8c7hs7f
下载链接
链接失效反馈
官方服务:
资源简介:
The MFFHA-DTI dataset contains three datasets, Human, C.elegans and interaction type dataset. The public datasets Human and C.elegans are used to predict the probability of drug target interaction. They are universal and effective in drug research and development. These two datasets are composed of smiles structural formula, protein sequence, and 1 and 0 Tags of whether they interact. They contain highly reliable negative DTI. In short, it is a balanced DTI dataset, in which the ratio of positive samples to negative samples (positive: negative) is about 1:1. The newly constructed dataset is used to predict the specific type of drug target interaction. The data is filtered from the 2024-04 version of all drugs dataset in the drugbank database. The smiles structural formula of the drug, the corresponding protein sequence of a single protein that reacts, and the type of interaction between them are selected. Then the pdb file of the corresponding protein is obtained from alphafolddb website according to the UniProt ID in the all drugs dataset, and then the data obtained from the drugbank database is mapped according to the ID.
创建时间:
2025-07-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作