five

Benefits and limits of phasing alleles for network inference of allopolyploid complexes

收藏
DataONE2024-05-08 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:c5697c3fc8c8594944e1cfc2b82205c2957cc5690afbea5c318c23ac84894a36
下载链接
链接失效反馈
官方服务:
资源简介:
Accurately reconstructing the reticulate histories of polyploids remains a central challenge for understanding plant evolution. Although phylogenetic networks can provide insights into relationships among polyploid lineages, inferring networks may be hindered by the complexities of homology determination in polyploid taxa. We use simulations to show that phasing alleles from allopolyploid individuals can improve phylogenetic network inference under the multispecies coalescent by obtaining the true network with fewer loci compared to haplotype consensus sequences or sequences with heterozygous bases represented as ambiguity codes. Phased allelic data can also improve divergence time estimates for networks, which is helpful for evaluating allopolyploid speciation hypotheses and proposing mechanisms of speciation. To achieve these outcomes in empirical data, we present a novel pipeline that leverages a recently developed phasing algorithm to reliably phase alleles from polyploids. This pip..., , Included is a single pdf of the supplementary material as well as a tarball with all of the data used in the manuscript. Download the tarball and within is a readme detailing contents including: 1) Control files and simulated sequence data for the single allotetraploid with BPP 2) A static release of the PATÉ pipeline used for phasing Dryopteris sequences in the manuscript 3) Dryopteris data used for analyses 4) BPP Control files, Julia scripts, and notes for repeating some of the empirical analyses, # Benefits and Limits of Phasing Alleles for Network Inference of Allopolyploid Complexes [https://doi.org/10.5061/dryad.5qfttdz53](https://doi.org/10.5061/dryad.5qfttdz53) Contents include the simulated sequence data as well as the empirical *Dryopteris* data and some of the files used for analyses. Most of the code used throughout the manuscript is cited and available on github. However, we include a static version of [PATÉ](https://github.com/gtiley/Phasing) used at the time of writing and the control file options used for our analyses. ## Description of the data and file structure ### simulation t1 - simulations for tau_h = 0.001 and theta = tau_h t2 - simulations for tau_h = 0.001 and theta = 2 * tau_h t3 - simulations for tau_h = 0.001 and theta = 3 * tau_h t4 - simulations for tau_h = 0.001 and theta = 4 * tau_h t5 - simulations for tau_h = 0.01 and theta = tau_h t6 - simulations for tau_h = 0.01 and theta = 2 * tau_h t7 - simulations for tau_h = 0.01 and theta = 3 * tau_h t...
创建时间:
2025-07-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作