five

Auxillary Table S7

收藏
Figshare2021-05-07 更新2026-04-08 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Auxillary_Table_S7/14200172
下载链接
链接失效反馈
官方服务:
资源简介:
A table containing statistics, accession numbers and other contextual data about the Single cell Assembled Genomes submitted to ENA. A table with the whole ~500.000 bins generated in this study can be found at the scilifelab-figshare doi:10.17044/scilifelab.13005311. Contains the fields: • bin_id: the unique identifier of a MAG • GC: GC-content of MAG • coding_density: coding-density estimate, e.g. sum of the length of all predicted amino-acid sequences times 3 divided by the length of the genome (in bases) • completeness: estimated completeness computed by CheckM • contamination: estimated redundancy computed by CheckM • strain_heterogeneity: computed by CheckM • length: length in bases of the MAG • nb_contigs: number of contigs in the MAG • nb_proteins: number of predicted proteins encoded in the bin (for the predicted amino-acid sequences in the easy-access repository) • gtdbtk_classification: classification by GTDBtk r89 • sourmash_classification: classification by sourmash using 2 databases, the GTDB (r89), and a database based on our GTDBtk classification • taxonomy: conclusion of the two previous • mOTU: membership to a metagenomic Operational Taxonomic Unit (mOTU), e.g. species level clustering of bins • path: relative path a tar-ball containing nucleotide, CDS- and amino-acid sequences, GFF, eggNOGmapper, and sourmash-signature files. Relative to https://export.uppmax.uu.se/uppstore2018116/stratfreshdb/ • accession: SRA/ENA sample ID of the assembly • genbank_accession: genbank accession number
提供机构:
Leyden Fernandez Vidal; Sarahi L. Garcia
创建时间:
2021-05-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作