MFFHA-DTI Dataset. Zhao et al.
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/9fw8c7hs7f
下载链接
链接失效反馈官方服务:
资源简介:
The MFFHA-DTI dataset contains three datasets, Human, C.elegans and interaction type dataset. The public datasets Human and C.elegans are used to predict the probability of drug target interaction. They are universal and effective in drug research and development. These two datasets are composed of smiles structural formula, protein sequence, and 1 and 0 Tags of whether they interact. They contain highly reliable negative DTI. In short, it is a balanced DTI dataset, in which the ratio of positive samples to negative samples (positive: negative) is about 1:1. The newly constructed dataset is used to predict the specific type of drug target interaction. The data is filtered from the 2024-04 version of all drugs dataset in the drugbank database. The smiles structural formula of the drug, the corresponding protein sequence of a single protein that reacts, and the type of interaction between them are selected. Then the pdb file of the corresponding protein is obtained from alphafolddb website according to the UniProt ID in the all drugs dataset, and then the data obtained from the drugbank database is mapped according to the ID.
创建时间:
2025-07-25



