five

Kraken2 Human Pangenome Reference Consortium database

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8339731
下载链接
链接失效反馈
官方服务:
资源简介:
A kraken2 database built from the genome assemblies used by the Human Pangenome Reference Consortium (https://projects.ensembl.org/hprc/). This archive contains the three files required by kraken2, hash.k2d, opts.k2d, and taxo.k2d, along with inspect.txt, which is obtained by running kraken2-inspect on the database, ktaxonomy.tsv, which contains the taxonomy information of the database (obtained by running https://github.com/jenniferlu717/KrakenTools#make_ktaxonomypy). The genomes for this database were downloaded using the assembly summary text file included in this dataset and genome_updater.sh (v0.6.3; https://github.com/pirovc/genome_updater) genome_updater.sh -m -a -f "genomic.fna.gz" -t 8 -e "hprc_assembly_summary.txt" -o HPRC_genomes/ The python script prepare_kraken_fasta.py was then used to prepare the assemblies for use in kraken with the following command python prepare_kraken_fasta.py -r -T 9606 -o HPRC.fna HPRC_genomes/ The database was then built with kraken2 using the following commands kraken2-build --download-taxonomy --db db/ kraken2-build --add-to-library HPRC.fna --db db/ kraken2-build --build --db db/ --threads 16
创建时间:
2024-02-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作