Datasets: AFsample2 predicts multiple conformations and ensembles with AlphaFold2

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://zenodo.org/record/14534087

下载链接

链接失效反馈

官方服务：

资源简介：

Directory Overview An overview of the directory structure, along with a description of each folder and its contents. Extract as follows. tar --use-compress-program=unzstd -xvf analysis_results.tar.zst tar --use-compress-program=unzstd -xvf input_datasets.tar.zst tar --use-compress-program=unzstd -xvf generated_models.tar.zst 1. analysis_results Contains results (.csv) of ensemble analysis Files required to generate images for the manuscript with included summarize.ipynb notebook (updated notebook available at https://github.com/iamysk/AFsample2/blob/main/notebooks/summarize.ipynb) tar --use-compress-program=unzstd -xvf analysis_results.tar.zst └── analysis_results ├── general ├── oc23 │ ├── afsample2 │ ├── SPEACH_AF │ ├── ... └── tp16 ├── afsample2 ├── SPEACH_AF ├── ... 2. generated_models This directory stores models generated during the project. It is organized by dataset names (e.g., oc23, tp16). Inside generated_models: oc23 and tp16: Subdirectories corresponding to models corresponding to various methods (AFcluster, AFsample, AFsample2, MSA subsampling, SPEACH_AF) employed in the study. All methods except AFcluster have been implemented in AFsample2 code. tar --use-compress-program=unzstd -xvf input_datasets.tar.zst └── input_datasets ├── oc23 │ ├── fastas │ ├── filtered_dict.pickle # pdbids and stats for states │ ├── msas # in .pkl format │ └── pdbs # All reference pdbs used in the study └── tp16 ├── fastas ├── msas └── pdbs 3. input_datasets Contains raw input datasets used for analysis and model generation. Organized by dataset names (e.g., oc23, tp16). Inside input_datasets (oc23 and tp16): fastas: Directory containing FASTA sequence files. filtered_dict.pickle (in oc23 only): A Python pickle file with preprocessed or filtered data. msas: Multiple Sequence Alignments (MSAs) in .pkl format used as input for modeling. pdbs: PDB structure files related to the datasets. tar --use-compress-program=unzstd -xvf generated_models.tar.zst └── generated_models ├── oc23 │ ├── afsample2 │ ├── SPEACH_AF │ ├── ... └── tp16 ├── afsample2 ├── SPEACH_AF ├── ... Citation If you use this dataset, please cite the associated publications.

创建时间：

2025-03-03

5,000+

优质数据集

54 个

任务类型

进入经典数据集