five

MetalHawk: Enhanced Classification Of Metal Coordination Geometries by Artificial Neural Networks

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7955751
下载链接
链接失效反馈
官方服务:
资源简介:
The data used to train and validate the CSD-NN and PDB-NN models. These neural networks built in Scikit-learn and are trained to recognize the geometry and coordination number of a metal site from the geometric features including distances and angles of the six atoms closest to the metal. The sites are stored as .pdb files composed of all atoms falling within 10 angstroms of the central metal atom and they include both metal complexes deposited in the Cambridge Structural Database (CSD, Version 5.42-5.43) and bioinorganic sites deposited in the Protein Data Bank (PDB), retrieved through the MetalPDB interface (up to the end of 2018). The sites are divided in seven geometry classes: linear (LIN), trigonal planar (TRI), tetrahedral (TET), square planar (SPL), square pyramidal (SQP), trigonal bipyramidal (TBP) and octahedral (OCT). The number of metal sites in each file is the following: CSD_dataset_pdbs.zip - 110K  CSD_validation_dataset_pdbs.zip - 1369 PDB_dataset_pdbs.zip file - 2960 PDB_validation_dataset_pdbs.zip - 106 The file images_and_data_analysis.zip contains all data and code required to replicate the figures shown in the paper. The benchmark_fp.zip file contains code for the benchmark of Metalhawk and its comparison to Findgeo, another tool for coordination geometry classification.
创建时间:
2023-09-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作