five

MD simulations and ML dataset of HLA-EpiCheck epitope predictor tool

收藏
DataCite Commons2025-05-16 更新2025-04-16 收录
下载链接:
https://entrepot.recherche.data.gouv.fr/citation?persistentId=doi:10.57745/GXZHH8
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains all the data used to implement the B-cell epitope predictor tool called HLA-EpiCheck (see https://doi.org/10.1101/2023.12.18.572133). CONTENTS: - pre-patches: Directory containing the computed pre-patches. A pre-patch corresponds to the set of residues within a given distance from a residue. The patches are generated subsequently by keeping only the solvent-accessible residues. Files are organized by locus and by antigen. A file contains the pre-patches associated to a given residue computed for any frame considered in the trajectory. + _patches_resid__size_.txt: Each line in the file corresponds to a pre-patch of a given frame. Line format : : Residue numbering is the same as in the PDB files. - trajectories: Directory contaning the MD data. Files are organized by locus and by antigen. + .dcd: 10ns MD trajectory comprising 1000 frames. Water molecules were removed. + .psf: Topology file of the .dcd trajectory. + .pdb: Starting structure of the MD simulation. - training_set_size_15.csv: training set used to train HLA-EpiCheck. - test_set_size_15.csv: test set used to evaluate HLA-EpiCheck. - table_patch_ID_antigen_residue.csv: Table containing the antigen and central residue associated to each patch. - model_ERF_radius_15.pkl: ML model of HLA-Epicheck in pickle format. Pickle version 4.0 used. - descriptors_eplets_non-confirmed.csv: Descriptors of the non-confirmed residue patches. - preds_non_confirmed_DQ.csv: HLA-EpiCheck predictions on the non-confirmed residue patches of eplets from locus DQ. - PDB_modeled_structures.txt : List of antigens modeled from a PDB structure with the corresponding PDB entry.
提供机构:
Recherche Data Gouv
创建时间:
2023-12-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作