cg2all datasets
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8273738
下载链接
链接失效反馈官方服务:
资源简介:
Training/validation/test sets for the cg2all development.
In the "set.tar.gz" file, there are multiple files containing lists of PDB IDs.
targets.train.pdb.6k: training set, pdb.6k
targets.train.pdb.29k: training set, pdb.29k
targets.valid.pdb: validation set for both pdb.6k and pdb.29k
targets.test.pdb: test set for both pdb.6k and pdb.29k
In each "tgz" file, there are two subdirectories: original and augment. In "original" directory, there are PDB files, which are curated and cleaned up using process_pdb.py script. In "augment" directory, there are the same set of PDB files with different atomic coordinates. They were used to augment the original training data set in terms of sidechain's rotamer states. For the details about the augmentation procedure, please refer to our paper.
创建时间:
2023-08-23



