Intraspecific genome SNP frequencies comparison
收藏DataONE2023-02-08 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:9194f2161e2b460e5f939f966a1af72c5fd43b9d3da1e2fd319e320bdc00c905
下载链接
链接失效反馈官方服务:
资源简介:
Genome sequence analyses can provide crucial for understanding the origin and spread of infectious diseases, population history, speciation, and taxonomy. In Class Agaricomycete where most mushroom-forming fungi belong, most species so far have been defined based on morphological, ecological, and/or molecular features, but there is no defined threshold for any type of features that can be applied across multiple genera, families, and orders. In this study, we investigated genome-wide single nucleotide polymorphism (SNP) frequencies within species to understand the patterns of variation within both the nuclear and mitochondrial genomes of the current whole-genome sequenced species. In total, our analyses included 398 and 106 published available nuclear and mitochondrial genomes of Agaricomycetes, respectively. The SNP frequencies among nuclear genomes within individual species ranged 0.00~7.69% while for the mitochondrial genome comparison, the intraspecific SNP frequencies ranged 0.00~4..., The assembled nuclear and mitochondrial (mt) genome data of Agaricomycetes were downloaded from the National Center of Biology Information (NCBI, https://www.ncbi.nlm.nih.gov/genome/?term=) and the Joint Genome Institute (JGI, https://mycocosm.jgi.doe.gov/mycocosm/home) genome database up to August 31, 2022. For each analyzed genome, the sequencing technology used, assembled genome size, sequencing read coverage depth, number of scaffolds and/or contigs, N50 (the minimum scaffold/contig length needed to cover 50% of the genome, L50 (the number of contigs required to reach N50), the mitogenome size and the related references were all retrieved when available. The species containing at least two nuclear or two mt genomes were selected for further analyses. , The genome-wide SNP analyses within individual species were determined by the alignment-based program MUMmer 3.23, with longer assemblies (larger genome and better-assembled genomes/fewer scaffolds) in each pairwise comparison serving as the reference for each analyzed species. Our alignments used the following specific commands: ââmum -pâ parameter for aligning each pair of assembled genomes and identifying overlapping regions between two profiles (maxgap=500, mincluster=100), followed by âdelta-filter -1â processing to filter out repeated comparisons, then âshow-snps -CHITrlâ to detect base substitutions. Insertions and deletions (InDels) in those overlapping regions were excluded from SNP frequency calculations.
创建时间:
2025-07-21



