five

Accession numbers for reference sequences used in Kraken database build

收藏
Zenodo2025-12-02 更新2026-05-26 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.17778887
下载链接
链接失效反馈
官方服务:
资源简介:
This is a metadata file containing all the reference sequence accession numbers used to construct the custom Kraken 2 database in the paper “Airborne eDNA captures three decades of ecosystem biodiversity”. The file is the seqid2taxid.map file output by Kraken 2 during the database build process and maps the sequence accessions to their corresponding taxonomic identifiers (TaxID). Accessions of the form “kraken:taxid|TaxID|ACCESSION” are custom substrings added to the fasta headers of sequences not present in the “nucl_gb.accession2taxid” and “nucl_wgs.accession2taxid” files (both part of the NCBI taxonomy) that Kraken searches to get the TaxID mappings of the input sequences – to allow Kraken to match these sequences to the correct TaxID. Sequence accessions can be retrieved from the last part of that substring. The file is approximately 24 Gb in decompressed form and contains ~700 million accessions.
提供机构:
Zenodo
创建时间:
2025-12-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作