Data for article: Generation of accurate, expandable phylogenomic trees with uDANCE
收藏DataONE2023-06-20 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:97a0b012b66ba568173dd5f7af68cdcd8400744476310974a933834b89bbaf58
下载链接
链接失效反馈官方服务:
资源简介:
Phylogenetic trees provide a framework for organizing evolutionary histories across the tree of life and aid downstream comparative analyses such as metagenomic identification. Methods that rely on single marker genes such as 16S rRNA have produced trees of limited accuracy with hundreds of thousands of organisms, whereas methods that use genome-wide data are not scalable to large numbers of genomes. We introduce uDance, a method that enables updatable genome-wide inference using a divide-and-conquer strategy that refines different parts of the tree independently and can build off of existing trees, with high accuracy and scalability. With uDance, we infer a species tree of roughly 200,000 genomes using 387 marker genes, totaling 42.5 billion amino acid residues.
Simulated datasets: HD-100, HD-500, MD-100, MD-500, LD-100, HD-P1, HD-P2, HD-P3, HD-P4, HD-P5, HD-HET, Serial, and Varying Backbone size/Partition size/Backbone Tree. Biological datasets: Input, intermediate, and output files for uDance runs that generated the 16K and 200K trees of life.
创建时间:
2023-11-08



