five

core-genome tree of 882 assembled genomes

收藏
Figshare2022-07-18 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/core-genome_tree_of_882_assembled_genomes/16611823/1
下载链接
链接失效反馈
官方服务:
资源简介:
Pantagruel (Lassalle <i>et al.,</i> 2019) was used to infer a maximum-likelihood (ML) “core genome tree” from the concatenated alignments of 291 core-genome genes extracted from the 889 assembled <i>V. cholerae</i> genomes using the “-S” option and otherwise default parameters. The ML tree was computed from the resulting 37,170 polymorphic positions using RAxML v8.2.11 (model ASC_GTRGAMMAX using Stamatakis’ ascertainment bias correction; one starting parsimony tree; 200 rapid bootstraps for estimating branch supports). Seven genomes were removed from the tree as they were identified as contaminated genome assemblies based on the Pantagruel pangenome analysis, for a final total of 882 genomes, presented in the trees with the 'nocontam' tag in the file name.Core-genome gene families were selected as those resent in at least 884/889 genomes i.e. absent in maximum ~0.5% of the genomes; locus_tags of representative genes in genome assembly 33224_2#41 are listed with their predicted protein function in file 'pseudo-core-884-unicopy_families.representative_product.tab'.<br>the full concatenated alignment is given in 'pseudo-core-884-unicopy_concat_cds_0-genomes_contextallyemen2019.aln.gz'<br>the same alignment reduced to its SNP positions is given in 'pseudo-core-884-unicopy_concat_cds_0-genomes_contextallyemen2019.snp.aln.gz'<br>
提供机构:
Lassalle, Florent
创建时间:
2022-07-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作