five

Data for: PickMe: sample selection for species tree reconstruction using coalescent weighted quartets

收藏
Mendeley Data2024-05-17 更新2024-06-27 收录
下载链接:
https://zenodo.org/records/5168055
下载链接
链接失效反馈
官方服务:
资源简介:
After collecting large data sets of many genes for many species for phylogenomics studies, researchers may make ad hoc decisions about which genes or samples to include in a species tree reconstruction analysis based on various parameters, including the amount of missing data. Optimally, sampling would be maximized, but it can be difficult for empiricists to determine where to draw the line for sample inclusion when data sets are incomplete. Under the multispecies coalescent model, in which the dominant quartet topology displayed across gene trees matches the topology of that quartet on the species tree, we propose a Bayesian framework to select samples for which there is support for inclusion in a species tree analysis. Given a collection of gene trees, a posterior probability is assigned to each quartet topology, describing the likelihood that the species tree displays this topology. From this, individual samples are assigned reliability scores computed as the average of a rescaling of the posterior probabilities. These weights are used in a Bayesian framework in an algorithm called PickM}, which determines which individuals should be included in a species tree analysis. To illustrate the efficacy of this tool, PickMe is applied to gene trees generated from target capture data from milkweeds. PickMe indicates that more samples could have reliably been included in a previous milkweed phylogenomic analysis than the authors analyzed, without access to a formal decision-making procedure. Thus, PickMe will be a valuable addition to data analysis pipelines for phylogenomics studies.
创建时间:
2023-06-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作