five

Auxillary Table S6

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://figshare.com/articles/dataset/Auxillary_Table_S6/14200175
下载链接
链接失效反馈
官方服务:
资源简介:
A table containing statistics, accession numbers and other contextual data about the 12682 Metagenome Assembled Genomes of moderate and high-quality submitted to ENA (e.g. completeness estimated higher then 40% and redundancy lower then 5%). A table with the whole ~500.000 bins generated in this study can be found at the scilifelab-figshare doi:10.17044/scilifelab.13005311. Contains the fields: •bin_id: the unique identifier of a MAG •GC: GC-content of MAG •coding_density: coding-density estimate, e.g. sum of the length of all predicted amino-acid sequences times 3 divided by the length of the genome (in bases) •completeness: estimated completeness computed by CheckM •contamination: estimated redundancy computed by CheckM •strain_heterogeneity: computed by CheckM •length: length in bases of the MAG •nb_contigs: number of contigs in the MAG •nb_proteins: number of predicted proteins encoded in the bin (for the predicted amino-acid sequences in the easy-access repository) •gtdbtk_classification: classification by GTDBtk r89 •sourmash_classification: classification by sourmash using 2 databases, the GTDB (r89), and a database based on our GTDBtk classification •taxonomy: conclusion of the two previous •mOTU: membership to a metagenomic Operational Taxonomic Unit (mOTU), e.g. species level clustering of bins •path: relative path a tar-ball containing nucleotide, CDS- and amino-acid sequences, GFF, eggNOGmapper, and sourmash-signature files. Relative to https://export.uppmax.uu.se/uppstore2018116/stratfreshdb/ •accession: SRA/ENA sample ID of the assembly •genbank_accession: genbank accession number
创建时间:
2021-05-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作