five

Additional file 1 of Comparative pangenomics: analysis of 12 microbial pathogen pangenomes reveals conserved global structures of genetic and functional diversity

收藏
Figshare2022-01-05 更新2026-04-08 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Additional_file_1_of_Comparative_pangenomics_analysis_of_12_microbial_pathogen_pangenomes_reveals_conserved_global_structures_of_genetic_and_functional_diversity/17870487/1
下载链接
链接失效反馈
官方服务:
资源简介:
Additional file 1: Figure S1. Phylogenetic tree of selected species and MLST subtype distributions. a) Phylogenetic tree constructed for representative genomes of each species using PATRIC’s Codon Tree service. Genomes are labeled by their name and PATRIC Genome ID. b) Distribution of MLST subtypes for each species’ genome collection. The relative abundance of the top 5 MLST subtypes, all other subtypes, and untyped genomes are shown per species. Figure S2. Evaluation of accuracy of Heaps’ Law at predicting pangenome size, with or without controlling from MLST. a) Example fit of Heaps’ Law to first half of genomes (unbalanced) or MLSTs (MLST balanced) and extrapolation to second half to evaluate pangenome size projection. b-c) Median mean absolute error (MAE) across Heaps’ Law fits for 100 random genome orderings, with or without MLST balancing, for each species in the fitting and extrapolation regions. Dotted lines indicate equal performance between the two methods. Figure S3. Gene frequency distributions by species. Figure S4. Fitted cumulative gene frequency distributions and corresponding core and unique gene frequency thresholds, by species. Observed distributions (solid blue), fitted functions (dashed orange), and the R2 and mean absolute errors (MAE) of the fits are shown. Fitted inflection points (black dot, dashed gray) and frequency thresholds corresponding to core and unique genes (dashed black) are also shown. Figure S5. COG functional group enrichment in the core, accessory, and unique genomes of 12 species. Heatmaps are colored by the log2 odds ratio (LOR) between each COG and the a) core, b) accessory, c) unique genome of each species. COGs are sorted by mean LOR across all species. LOR color scales are symmetric and identical for all plots; four values are outside of the color range: F x E. cloacae (LOR = − 7.5), Q x C. coli (LOR = − 6.0), and K x C. coli (LOR = − 6.9) for accessory genomes; F x E. faecium (LOR = − 5.3) for unique genomes. Starred cells correspond to statistically significant enrichments under Fisher’s Exact test with FWER
提供机构:
Hyun, Jason C.; Palsson, Bernhard O.; Monk, Jonathan M.
创建时间:
2022-01-05
二维码
社区交流群
二维码
科研交流群
商业服务