five

Dataset of AlphaFold's internal representations of 4,581 proteins relevant for drug discovery

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10671260
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the outputs of the AlphaFold model for 4,581 proteins that are relevant targets in drug discovery. More information on the dataset can be found at the following repository: Dataset structure: ↓ data/* -> main data directory ↓ data/PID/* -> data of a single protein of length L Filename Description Tensor shape Lightweight single.npy ( s i ) evoformer single representation [L x 384] ✔️ structure.npy ( a i ) output of the last layer of structure module [L x 384] ✔️ msa.npy*** ( m s i ) processed MSA representation [N x L x 256]   pair.npy*** ( z i j ) evoformer pair representation [L x L x 128]   PID.pdb 3D protein structure prediction   ✔️ PID_unrelaxed.pdb 3D protein structure prediction w/o relaxation step (D)   ✔️ confidence.npy* confidence in structure prediction (0-100) 1 ✔️ plldt.npy* confidence in structure prediction per residue [L] ✔️ PID.fasta protein amino acid sequence and metadata   ✔️ timings.json Processing log   ✔️ ↓ data/PID2/* -> data of protein #2 ... *Note: L: sequence length, N: number of aligned sequences via MSA.
创建时间:
2024-11-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作