MetaHIT "error-free" contigs from MetaBAT
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/MetaHIT_error-free_contigs_from_MetaBAT/27933807
下载链接
链接失效反馈官方服务:
资源简介:
We obtained this data from MetaHIT "error-free" contigs produced in Metabat [1]. The dataset was derived from raw read obtained obtained through 264 human gut sequencing runs conducted by the MetaHIT consortium [2]. The reads were mapped to 290 known genomes from the NCBI-database.
To create the contigs, the reference genomes were shredded, resulting in 41 to 1617 contigs per genome. These contigs had 31 overlapping bases and lengths spanning 2.5K to 64.5K base pairs. The contigs are considered "error-free" as they were obtained directly from known genomes rather than being constructed fromby an assembly algorithm. We filtered contigs below 2.5k basepairs, resulting in a total of 177.146 contigs.
See more in Metabat: https://bitbucket.org/berkeleylab/metabat/wiki/Home.
[1] Kang, Dongwan D., et al. "MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities." PeerJ 3 (2015): e1165.
[2] Ehrlich, S. Dusko, and MetaHIT Consortium. "MetaHIT: The European Union Project on metagenomics of the human intestinal tract." Metagenomics of the human body (2011): 307-316.
创建时间:
2024-11-30



