dataset.zip
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://figshare.com/articles/dataset/dataset_zip/11770902
下载链接
链接失效反馈官方服务:
资源简介:
Our final dataset comprises of 491,018 class I MHC allele data points covering 161 different HLAs and 64,954 class II MHC allele data points covering 49 different HLAs. The peptide lengths range from 3 to 43 for both classes, while the lengths of amino acid sequences of HLAs range from 180 to 347 for class I and from 85 to 232 for class II.
We get 451,484 HLA-A*, 39,424 HLA-B* and 110 HLA-C* class I alleles, out of which 379,783 are binding and 111,235 are non-binding. Similarly, we get 64,926 HLA-DRB1*, 22 HLA-DRB3*, 4 HLA-DRB4* and 2 HLA-DRB5* class II alleles out of which 36,035 are binding and 28,919 are non-binding.
本研究最终构建的数据集包含491018条I型主要组织相容性复合体(MHC)等位基因数据点,涵盖161种人类白细胞抗原(HLA,Human Leukocyte Antigen);同时包含64954条II型MHC等位基因数据点,涵盖49种HLA。两类数据的肽段长度范围均为3至43,其中I型HLA的氨基酸序列长度范围为180至347,II型HLA的氨基酸序列长度范围为85至232。
其中I型MHC等位基因数据包含451484条HLA-A*、39424条HLA-B*以及110条HLA-C*,对应结合型数据379783条,非结合型数据111235条。类似地,II型MHC等位基因数据包含64926条HLA-DRB1*、22条HLA-DRB3*、4条HLA-DRB4*以及2条HLA-DRB5*,对应结合型数据36035条,非结合型数据28919条。
创建时间:
2020-01-30



