Data for: PickMe: sample selection for species tree reconstruction using coalescent weighted quartets
收藏Mendeley Data2024-05-17 更新2024-06-27 收录
下载链接:
https://zenodo.org/records/5168055
下载链接
链接失效反馈官方服务:
资源简介:
After collecting large data sets of many genes for many species for phylogenomics studies, researchers may make ad hoc decisions about which genes or samples to include in a species tree reconstruction analysis based on various parameters, including the amount of missing data. Optimally, sampling would be maximized, but it can be difficult for empiricists to determine where to draw the line for sample inclusion when data sets are incomplete. Under the multispecies coalescent model, in which the dominant quartet topology displayed across gene trees matches the topology of that quartet on the species tree, we propose a Bayesian framework to select samples for which there is support for inclusion in a species tree analysis. Given a collection of gene trees, a posterior probability is assigned to each quartet topology, describing the likelihood that the species tree displays this topology. From this, individual samples are assigned reliability scores computed as the average of a rescaling of the posterior probabilities. These weights are used in a Bayesian framework in an algorithm called PickM}, which determines which individuals should be included in a species tree analysis. To illustrate the efficacy of this tool, PickMe is applied to gene trees generated from target capture data from milkweeds. PickMe indicates that more samples could have reliably been included in a previous milkweed phylogenomic analysis than the authors analyzed, without access to a formal decision-making procedure. Thus, PickMe will be a valuable addition to data analysis pipelines for phylogenomics studies.
创建时间:
2023-06-28



