five

Intraspecific genome SNP frequencies comparison

收藏
DataONE2023-02-08 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:9194f2161e2b460e5f939f966a1af72c5fd43b9d3da1e2fd319e320bdc00c905
下载链接
链接失效反馈
官方服务:
资源简介:
Genome sequence analyses can provide crucial for understanding the origin and spread of infectious diseases, population history, speciation, and taxonomy. In Class Agaricomycete where most mushroom-forming fungi belong, most species so far have been defined based on morphological, ecological, and/or molecular features, but there is no defined threshold for any type of features that can be applied across multiple genera, families, and orders. In this study, we investigated genome-wide single nucleotide polymorphism (SNP) frequencies within species to understand the patterns of variation within both the nuclear and mitochondrial genomes of the current whole-genome sequenced species. In total, our analyses included 398 and 106 published available nuclear and mitochondrial genomes of Agaricomycetes, respectively. The SNP frequencies among nuclear genomes within individual species ranged 0.00~7.69% while for the mitochondrial genome comparison, the intraspecific SNP frequencies ranged 0.00~4..., The assembled nuclear and mitochondrial (mt) genome data of Agaricomycetes were downloaded from the National Center of Biology Information (NCBI, https://www.ncbi.nlm.nih.gov/genome/?term=) and the Joint Genome Institute (JGI, https://mycocosm.jgi.doe.gov/mycocosm/home) genome database up to August 31, 2022. For each analyzed genome, the sequencing technology used, assembled genome size, sequencing read coverage depth, number of scaffolds and/or contigs, N50 (the minimum scaffold/contig length needed to cover 50% of the genome, L50 (the number of contigs required to reach N50), the mitogenome size and the related references were all retrieved when available. The species containing at least two nuclear or two mt genomes were selected for further analyses. , The genome-wide SNP analyses within individual species were determined by the alignment-based program MUMmer 3.23, with longer assemblies (larger genome and better-assembled genomes/fewer scaffolds) in each pairwise comparison serving as the reference for each analyzed species. Our alignments used the following specific commands: “–mum -p” parameter for aligning each pair of assembled genomes and identifying overlapping regions between two profiles (maxgap=500, mincluster=100), followed by “delta-filter -1” processing to filter out repeated comparisons, then “show-snps -CHITrl” to detect base substitutions. Insertions and deletions (InDels) in those overlapping regions were excluded from SNP frequency calculations.
创建时间:
2025-07-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作