In silico mock communities for evaluation of taxonomic profilers across prokaryotes and viruses
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10777403
下载链接
链接失效反馈官方服务:
资源简介:
In silico mock communities generated with CAMISIM for benchmarking the performance of taxonomic profilers across prokaryotic (50 communities), eukaryotic (30 communities), and viral communities (10 communities) of the human microbiome. Metagenomes were generated using CAMISIM (Fritz et al., 2019), which simulates 2.1 Gb of Illumina 2 ×150 bp paired end reads with the default HiSeq 2500 error profile and a mean insert size of 200 bp. To assess profiling performance for a range of sequencing depths, the 50 in silico metagenomes were also rarefied with seqtk (-s100) to sequencing depths of 20, 5, 2, 1, 0.5, 0.25 and 0.1 million read pairs. Counts are provided for rarefied metagenomes.
Prokaryotic communitiesFor prokaryotic benchmarking, 10 body site-representative prokaryotic metagenomes were simulated for each of the following five body sites: adult gut, infant gut, oral, skin, and vagina. Genome accession ids for prokaryotic species found in each human body site were identified from published literature (Bäckhed et al., 2015; Proctor et al., 2019; Saheb Kashaf et al., 2021).
Adult Gut: pro_gut_adult.zipInfant Gut: pro_gut_infant.zipOral: pro_oral.zipSkin: pro_skin_1.zip, pro_skin_2.zip, pro_skin_3.zipVaginal: pro_vaginal.zip
Downsized counts:
Eukaryotic communities30 eukaryotic in silico metagenomes comprising up to 200 randomly sampled genomes from a set of 113 eukaryotic species (See Supplementary Table 2 from the paper) corresponding to the eukaryotic species within both CHAMP and MetaPhlAn 4 (Blanco-Míguez et al., 2023) databases.
Eukaryotic data is deposited here: doi: 10.5281/zenodo.12090449
Viral communities10 viral communities were simulated with 95% of the reads from bacteria and 5% of the reads originating from phages. Each community consisted of 200 randomly selected bacterial genomes from GTDB with species-level annotation and 200 viral genomes from the Gut Phage Database (GPD, Camarillo-Guerrero et al., 2021).
Counts: phage_communities_counts.zipFastQ, forward reads: camisimu_[1-10].fq.1.gzFastQ, reverse reads: camisimu_[1-10].fq.2.gz
References
Bäckhed, F., Roswall, J., Peng, Y., Feng, Q., Jia, H., Kovatcheva-Datchary, P., et al. (2015). Dynamics and Stabilization of the Human Gut Microbiome during the First Year of Life. Cell Host Microbe 17, 690–703. doi: 10.1016/J.CHOM.2015.04.004
Blanco-Míguez, A., Beghini, F., Cumbo, F., McIver, L. J., Thompson, K. N., Zolfo, M., et al. (2023). Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4. Nature Biotechnology 2023 41:11 41, 1633–1644. doi: 10.1038/s41587-023-01688-w
Camarillo-Guerrero, L. F., Almeida, A., Rangel-Pineros, G., Finn, R. D., and Lawley, T. D. (2021). Massive expansion of human gut bacteriophage diversity. Cell 184, 1098. doi: 10.1016/J.CELL.2021.01.029
Fritz, A., Hofmann, P., Majda, S., Dahms, E., Dröge, J., Fiedler, J., et al. (2019). CAMISIM: Simulating metagenomes and microbial communities. Microbiome 7, 1–12. doi: 10.1186/S40168-019-0633-6/FIGURES/5
Proctor, L. (2019). Priorities for the next 10 years of human microbiome research. Nature 2021 569:7758 569, 623–625. doi: 10.1038/d41586-019-01654-0
Saheb Kashaf, S., Proctor, D. M., Deming, C., Saary, P., Hölzer, M., Mullikin, J., et al. (2021). Integrating cultivation and metagenomics for a multi-kingdom view of skin microbiome diversity and functions. Nature Microbiology 2021 7:1 7, 169–179. doi: 10.1038/s41564-021-01011-w
创建时间:
2024-06-24



