Data from: Graph splitting: a graph-based approach for superfamily-scale phylogenetic tree reconstruction
收藏DataCite Commons2025-06-01 更新2025-06-15 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.ps0qf4r
下载链接
链接失效反馈官方服务:
资源简介:
A protein superfamily contains distantly related proteins that have
acquired diverse biological functions through a long evolutionary history.
Phylogenetic analysis of the early evolution of protein superfamilies is a
key challenge because existing phylogenetic methods show poor performance
when protein sequences are too diverged to construct an informative
multiple sequence alignment. Here, we propose the Graph Splitting (GS)
method, which rapidly reconstructs a protein superfamily-scale
phylogenetic tree using a graph-based approach. Evolutionary simulation
showed that the GS method can accurately reconstruct phylogenetic trees
and be robust to major problems in phylogenetic estimation, such as biased
taxon sampling, heterogeneous evolutionary rates, and long-branch
attraction when sequences are substantially diverged. Its application to
an empirical dataset of the triosephosphate isomerase (TIM)-barrel
superfamily suggests rapid evolution of protein-mediated pyrimidine
biosynthesis, likely taking place after the RNA world. Furthermore, the GS
method can also substantially improve performance of widely used multiple
sequence alignment methods by providing accurate guide trees.
提供机构:
Dryad
创建时间:
2019-07-22



