Cerberus reference data v0.1.0 — masked T2T-CHM13v2.0 + IPD-IMGT/HLA, Kraken2 GDPR DB, auxiliary k-mer references
收藏Zenodo2026-05-17 更新2026-05-26 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20258068
下载链接
链接失效反馈官方服务:
资源简介:
Reference data bundle for Cerberus, a three-headed host-removal pipeline for metagenomic data.
Contents:
masked_t2t_hla.mmi — minimap2 index of T2T-CHM13v2.0 + IPD-IMGT/HLA, low-entropy and viral-homology masked.
masked_t2t_hla_bt2.tar.zst — bowtie2 index of the same reference.
kraken2_gdpr_compact.tar.zst — compact Kraken2 database covering human (T2T-CHM13v2.0), chimpanzee, gorilla, mouse, and rat for GDPR-pass host scrubbing.
aux_refs.fa.gz — Human host-decoy ncRNA (rRNA, snRNA, snoRNA, miRNA, Y_RNA, vaultRNA) + mitochondrion (NC_012920.1).
human_k27.fa.gz — Full T2T-CHM13v2.0 used as bbduk k-mer reference in Cerberus's GDPR pass (orthogonal k=31 mechanism alongside Kraken2).
Source build scripts: scripts/build_refs/.
提供机构:
Zenodo
创建时间:
2026-05-17



