Some limitations of public sequence data for phylogenetic inference (in plants)
收藏DataONE2020-06-24 更新2025-05-10 收录
下载链接:
https://search.dataone.org/view/sha256:45d1d6e40ac46cfae609484449c1b34dae2649025e55ed2ef4e783e65b0da7a2
下载链接
链接失效反馈官方服务:
资源简介:
The GenBank database contains essentially all of the nucleotide sequence data generated for published molecular systematic studies, but for the majority of taxa these data remain sparse. GenBank has value for phylogenetic methods that leverage dataâmining and rapidly improving computational methods, but the limits imposed by the sparse structure of the data are not well understood. Here we present a tree representing 13,093 land plant generaâan estimated 80% of extant plant diversityâto illustrate the potential of public sequence data for broad phylogenetic inference in plants, and we explore the limits to inference imposed by the structure of these data using theoretical foundations from phylogenetic data decisiveness. We find that despite very high levels of missing data (over 96%), the present data retain the potential to inform over 86.3% of all possible phylogenetic relationships. Most of these relationships, however, are informed by small amounts of dataâapproximately half are inf...
创建时间:
2025-04-18



