Protein Information and Statistics
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Protein_Information_and_Statistics/25679964
下载链接
链接失效反馈官方服务:
资源简介:
Further, a compressed CSV file (protein_information_and_statistics.csv) contains valuable information regarding the protein sequences and models. The file comprises 42,158 entries, each corresponding to a protein sequence considered in the modeling process. It includes UniProt accession codes, gene names (and synonyms), amino-acid sequences, details about the model type available for each protein (AlphaFold 2 or downloaded from the AlphaFold database, OpenFold, ESMFold, homology model), and, if applicable, the ligands they include. Additionally, the file provides detailed information about each model's secondary structure as well as a statistical analysis of quality parameters, offering insights into the reliability of each model.
此外,一个名为protein_information_and_statistics.csv的压缩逗号分隔值(Comma-Separated Values,CSV)文件,包含了与蛋白质序列及建模模型相关的宝贵信息。该文件共计四万二千一百五十八条条目,每条对应建模流程中纳入的一条蛋白质序列,其内容涵盖UniProt登录号(UniProt accession codes)、基因名称(及同义词)、氨基酸序列、各蛋白质可用模型类型的详细信息(包括AlphaFold 2、从AlphaFold数据库下载的模型、OpenFold、ESMFold以及同源建模模型(homology model)),若适用还包含其所结合的配体信息;除此以外,该文件还提供了各模型的二级结构详细信息,以及质量参数的统计分析结果,可为评估各模型的可靠性提供参考依据。
创建时间:
2024-05-29



