Kraken2 Human Pangenome Reference Consortium Release 2 database
收藏DataCite Commons2025-11-18 更新2026-02-09 收录
下载链接:
https://figshare.com/articles/dataset/Kraken2_Human_Pangenome_Reference_Consortium_Release_2_database/30647910/1
下载链接
链接失效反馈官方服务:
资源简介:
A kraken2 database built from the genome assemblies generated in the second release of the Human Pangenome Reference Consortium (https://projects.ensembl.org/hprc/). This archive contains the three files required by kraken2, hash.k2d, opts.k2d, and taxo.k2d, along with inspect.txt, which is obtained by running <code>kraken2-inspect</code> on the database.The genomes for this database were downloaded using the assembly summary text file included in this dataset and genome_updater.sh (v0.6.3; https://github.com/pirovc/genome_updater)<pre><code>genome_updater.sh -m -a -f "genomic.fna.gz" -t 8 -e "hprc.r2_assembly_summary.txt" -o HPRC_genomes/</code></pre>The python script prepare_kraken_fasta.py was then used to prepare the assemblies for use in kraken with the following command<pre><code>python prepare_kraken_fasta.py -r -T 9606 -o HPRC.fna HPRC_genomes/</code></pre>The database was then built with kraken2 (v2.1.6) using the following commands<pre><code>k2 download-taxonomy --db db</code><br><code>k2 add-to-library --db db --threads 8 --file genomes.fna --no-masking</code><br><code>k2 build --db db --no-masking --threads 16</code><br><code>k2 clean --db db</code><br><code>k2 inspect --db db > inspect.txt</code></pre>
提供机构:
figshare
创建时间:
2025-11-18



