Auxillary Table S6
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://figshare.com/articles/dataset/Auxillary_Table_S6/14200175
下载链接
链接失效反馈官方服务:
资源简介:
A table containing statistics, accession numbers and other contextual data about the 12682 Metagenome Assembled Genomes of moderate and high-quality submitted to ENA (e.g. completeness estimated higher then 40% and redundancy lower then 5%). A table with the whole ~500.000 bins generated in this study can be found at the scilifelab-figshare doi:10.17044/scilifelab.13005311. Contains the fields:
•bin_id: the unique identifier of a MAG
•GC: GC-content of MAG
•coding_density: coding-density estimate, e.g. sum of the length of all predicted amino-acid sequences times 3 divided by the length of the genome (in bases)
•completeness: estimated completeness computed by CheckM
•contamination: estimated redundancy computed by CheckM
•strain_heterogeneity: computed by CheckM
•length: length in bases of the MAG
•nb_contigs: number of contigs in the MAG
•nb_proteins: number of predicted proteins encoded in the bin (for the predicted amino-acid sequences in the easy-access repository)
•gtdbtk_classification: classification by GTDBtk r89
•sourmash_classification: classification by sourmash using 2 databases, the GTDB (r89), and a database based on our GTDBtk classification
•taxonomy: conclusion of the two previous
•mOTU: membership to a metagenomic Operational Taxonomic Unit (mOTU), e.g. species level clustering of bins
•path: relative path a tar-ball containing nucleotide, CDS- and amino-acid sequences, GFF, eggNOGmapper, and sourmash-signature files. Relative to https://export.uppmax.uu.se/uppstore2018116/stratfreshdb/
•accession: SRA/ENA sample ID of the assembly
•genbank_accession: genbank accession number
创建时间:
2021-05-07



