five

Faster‐haplodiploid evolution under divergence‐with‐gene‐flow: Simulations and empirical data from pine‐feeding hymenopterans

收藏
Mendeley Data2024-04-13 更新2024-06-28 收录
下载链接:
https://datadryad.org/stash/dataset/doi:10.5061/dryad.fbg79cnwx
下载链接
链接失效反馈
官方服务:
资源简介:
DNA was extracted and ddRAD libraries were prepared for 23 Neodiprion pinetum larvae and 44 N. lecontei larvae collected from multiple locations in Kentucky, as well as 4 interspecific hybrids and an additional 18 N. lecontei samples from an allopatric population in Michigan. These libraries were sequenced using 150-bp paired end reads on an Illumina HiSeq 4000. We also collected whole-genome resequncing data from a N. virginiana sample to be used as an outgroup in demographic analyses. Neodiprion sequencing reads are available via the NCBI SRA, accession numbers: SAMN23893940-SAMN23893944, SAMN23893948, SAMN23893960-SAMN23893963, SAMN23893965, and SAMN25157024-SAMN25157101. We aligned demultiplexed ddRAD reads to the N. lecontei reference genome (Nlec1.1 GenBank assembly accession number- GCA_001263575.2) using the very sensitive setting in bowtie2. We only retained reads that aligned to one locus in the reference genome and had a Phred score greater than 30. For the ddRAD dataset, we removed PCR duplicates using a custom script. We called SNPs in samtools. We required all sites to have a minimum of 7x coverage and 50% missing data or less. We also removed SNPs with significantly more heterozygotes than expected under Hardy-Weinberg equilibrium (an indicator of genotyping/mapping error). We removed any individual that was missing more than 70% of the data. We performed all filtering in VCFtools v0.1.13. We created several datasets with subsets of individuals and additional filtering for each of the population genetic analyses. We generated three data sets with minor allele filtering (MAF, SNPs <0.01 removed): 1) sympatric N. pinetum and N. lecontei for genome-wide patterns of divergence (36,935 SNPs), 2) sympatric N. pinetum, N. lecontei, and hybrids for admixture analysis (35,649 SNPs), and 3) sympatric N. pinetum, N. lecontei, allopatric N. lecontei, and outgroup N. virginiana for ABBA- BABA tests (12,905 SNPs). We also generated a down-sampled dataset (described below) without a MAF filter for estimating site-frequency spectra (SFS) that included sympatric N. pinetum, N. lecontei, and N. virginiana for demographic analyses.
创建时间:
2023-06-28
二维码
社区交流群
二维码
科研交流群
商业服务