Data for: Weighting by gene tree uncertainty improves accuracy of quartet-based species trees
收藏DataCite Commons2025-06-01 更新2025-04-10 收录
下载链接:
https://datadryad.org/dataset/doi:10.6076/D1WK5R
下载链接
链接失效反馈官方服务:
资源简介:
Phylogenomic analyses routinely estimate species trees using methods that
account for gene tree discordance. However, the most scalable species tree
inference methods, which summarize independently inferred gene trees to
obtain a species tree, are sensitive to hard-to-avoid errors introduced in
the gene tree estimation step. This dilemma has created much debate on the
merits of concatenation versus summary methods and practical obstacles to
using summary methods more widely and to the exclusion of concatenation.
The most successful attempt at making summary methods resilient to noisy
gene trees has been contracting low support branches from the gene trees.
Unfortunately, this approach requires arbitrary thresholds and poses new
challenges. Here, we introduce threshold-free weighting schemes for the
quartet-based species tree inference, the metric used in the popular
method ASTRAL. By reducing the impact of quartets with low support or long
terminal branches (or both), weighting provides stronger theoretical
guarantees and better empirical performance than the unweighted ASTRAL.
Our simulations show that weighting improves accuracy across many
conditions and reduces the gap with concatenation in conditions with low
gene tree discordance and high noise. On empirical data, weighting
improves congruence with concatenation and increases support. Together,
our results show that weighting, enabled by a new optimization algorithm
we introduce, improves the utility of summary methods and can reduce the
incongruence often observed across analytical pipelines.
提供机构:
Dryad
创建时间:
2023-06-23



