five

A collection of high-quality human assemblies

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13948741
下载链接
链接失效反馈
官方服务:
资源简介:
A collection of high-quality human assemblies, including: T2T-CHM13 v2.0 analysis set with HG002 chrY and rCRS chrM GRCh38 no-alt analysis set with rCRS chrM CN1 v1.0.1 YAO v1.1 KSA001 v1.1.0 232 HPRC r2/v0.6 samples (464 assemblies; HG00272 has a ~50Mb inversion misassembly) Use AGC to extract indivual genomes and use ropebwt3 to query the FM-index: agc listset human472.agc   # list genomesagc getset human472.agc 200149_HG02523.pat > HG02523.pat.fa # extract one genomegzip -d human472.fmr.gz     # decompress the incremental indexropebwt3 build -i human472.fmr -do human472.fmd  # convert to a faster query formatgzip -d human472.fmd.ssa.gz  # decompress suffix array samplesecho CCAGGACCCCTGTCCAGTGTTAGACAGGAGCATGCAG | ropebwt3 sw -eN200 -Lm10 human472.fmd - Note: The HPRC assemblies are already available from GenBank. This repository only provides a convenient way to download them. However, these assemblies are not formally published. You may use them for algorithm development or performance evaluation. If you want to use the genomes for biological discovery, please contact HPRC.
创建时间:
2025-02-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作