extHomFam 2: large-scale benchmark for protein multiple sequence alignments
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6524236
下载链接
链接失效反馈官方服务:
资源简介:
extHomFam 2 was constructed by combining Homstrad reference alignments (March 2020) with Pfam 33.1 complete families (NCBI variant). Homstrad entries with less than 3 reference sequences and those pointing to dead Pfam families were discarded. The resulting benchmark was divided into subsets depending on the family size N:
subset
N range
# families
small
[200, 10 000)
86
medium
[10 000, 40 000)
95
large
[40 000, 100 000)
83
xlarge
[100 000, 250 000)
67
huge
[250 000, 3 000 000)
62
The directories in the archive correspond to the names of the subsets, while the reference alignments are located in 'ref' folder.
创建时间:
2022-05-11



