P. nodorum WA pangenome: supplementary material

Name: P. nodorum WA pangenome: supplementary material
Creator: figshare
Published: 2022-09-09 03:42:35
License: 暂无描述

DataCite Commons2022-09-09 更新2024-07-29 收录

下载链接：

https://figshare.com/articles/dataset/P_nodorum_WA_pangenome_supplementary_material/13325915/3

下载链接

链接失效反馈

官方服务：

资源简介：

Supplementary material for the manuscript: "Novel effector candidates and large accessory genome revealed by population-level pan-genomic analysis of Parastagonospora nodorum" These captions are summarised, for full captions please see the paper. Figure 1: The structure and features of the Western Australian (WA) Parastagonospora nodorum population. Figure 2: A circos plot showing SNP density over each of the 23 chromosomes in the SN15 genome assembly. Figure 3. A circos plot showing the proportion of RIP-like (CA↔TA or TG↔TA) mutations over transition (C↔T or G↔A) mutations for each of the 23 chromosomes in the SN15 genome assembly. Figure 4. A circos plot showing each Parastagonospora nodorum genome assembly alignment coverage for each of the 23 chromosomes in P. nodorum SN15. Figure 5. Dispensable and multi-copy orthogroups for each isolate in the P. nodorum pan-genome. Supp. table 1.Additional published genomes used in this study. Supp. table 2.Summary of Illumina sequencing read contamination detection. Supp. table 3.Parameters used to filter short variants by quality, and statistics of variants in the filtered set. Supp. table 4.Population diversity statistics and results of STRUCTURE analysis. Supp. table 5.Genome assembly for all isolates sequenced in this study. Statistics were collected using BBtools stats and QUAST. Supp. table 6.Summaries statistics of transposable elements, rRNA and tRNA genes, and repeat annotations for each assembled genome. Supp. table 7.Summary statistics of gene predictions for each isolate. Numbers are provided for each prediction method. Supp. table 8.SNP, counts RIP-like SNP ratios, and genome assembly alignment coverage data used to plot circular heatmaps in figures 2, 3, and 4. Supp. table 9.Orthogroup counts for each isolate used to plot figure 5. Supp. table 10. Functional annotation, selection, presence absence data for each orthogroup. A single representative sequence for each orthogroup was selected based on membership in reference genomes (preferentially SN15 > SN2000 > SN4 > SN79 > any other isolate), then by minimum distance to median length within the orthogroup, breaking ties randomly. The full orthogroup annotation can be found at https://doi.org/10.6084/m9.figshare.12966971.v4. Supp. table 11. GO term and effector enrichment tests for predicted functions and groups of orthogroups. Supp. data 1.MultiQC reports of read trimming and quality control for Illumina sequencing reads. Supp. data 2.Boxplots showing short variant (SNP, insertion/deletion, Mixed) genotype quality (GQ) statistics for each isolate. Each chromosome in SN15 is shown on a separate page in the PDF. Supp. data 3. Violin plots showing short variant (SNP, insertion/deletion, Mixed) genotype read depth (DP) statistics for each isolate. Each chromosome in SN15 is shown on a separate page in the PDF. Supp. data 4. Bar plots showing amounts of missing short variant genotype information for each isolate. Each chromosome in SN15 is shown on a separate page in the PDF. Supp. data 5.SNP locus quality statistics visualised for each chromosome in SN15 on separate pages in the PDF. Supp. data 6.Insertion and Deletion (INDEL) locus quality statistics visualised for each chromosome in SN15 on separate pages in the PDF. Supp. data 7.Mixed variant (multi-nucleotide variations, or insertions/deletions with SNPs at the same locus) locus quality statistics visualised for each chromosome in SN15 on separate pages in the PDF. Supp. data 8.Kernel density estimate plots showing the distributions of short variant locus quality statistics. Supp. data 9.Maximum likelihood phylogenetic tree estimated from 45,194 SNPs using IQTree. The file is in Newick format. Clade confidence values show SH-aLRT and UFBoot support separated by ‘/’. Supp. data 10.MSA and trees of ToxA, 1, 3 CDS/codon-aligned regions from pan-genome, to support prevalence of RIP-like SNPs across pan-genome in confirmed effector loci Supp. data 11.Example dot plot alignments between scaffolds and chromosomes containing orthogroups in PAV clusters selected from figure 5. Supp. figure 1. Phylogeographic representation of the WA P. nodorum populations, with phylogeny generated from whole-genome SNP data relative to alignment to the SN15 reference genome, and yellow lines indicating the approximate location of sampling. Supp. figure 2. Tanglegram comparison of predicted SNP phylogeny with the SSR predicted tree from Phan et al. (2020). Supp. figure 3.Comparison of population cluster assignment between this study and as identified by Phan et al. (2020). Supp. figure 4.Numbers of isolates in clusters from each sampling location. Supp. figure 5. Numbers of isolates in clusters from each sampling year. Supp. figure 6.The first six principal components computed from bi-allelic SNP data plotted for each sampling location. Supp. figure 7. The first six principal components computed from bi-allelic SNP data plotted against each sampling year.

提供机构：

figshare

创建时间：

2022-02-01

5,000+

优质数据集

54 个

任务类型

进入经典数据集