CheckMyBlob evaluation data set (TAMC)
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/1040850
下载链接
链接失效反馈官方服务:
资源简介:
A data set of ligands used to evaluate the CheckMyBlob method, described in the Kowiel et al. paper "Automatic recognition of ligands in electron density by machine learning methods".
This data set attempts to repeat the experimental setup from Terwilliger et al. described in "Ligand identification using electron-density map correlations". It consists of ligands from X-ray diffraction experiments with 6–150 non-H atoms. Connected PDB ligands were labeled as single alphabetically ordered strings of hetero-compound codes, whereas unknown species, water molecules, standard amino acids, and nucleotides were excluded. Finally, the data set was limited to 200 most popular ligands. The resulting data set consisted of 161,758 examples with individual ligand counts ranging from 36,535 examples for GOL (glycerol) to 114 for LMG (1,2-distearoyl-monogalactosyl-diglyceride).
For machine learning (classification) purposes, the target attribute is: res_name.
创建时间:
2023-08-08



