five

Data from: Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge

收藏
DataCite Commons2024-02-05 更新2024-07-13 收录
下载链接:
https://databank.illinois.edu/datasets/IDB-0569467
下载链接
链接失效反馈
官方服务:
资源简介:
This repository includes scripts and datasets for the paper, "Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge." All data files in this repository are for analyses using the logdet distance matrix computed on the concatenated alignment. Data files for analyses using the average gene-tree internode distance matrix can be downloaded from the Illinois Data Bank (https://doi.org/10.13012/B2IDB-1424746_V1). The latest version of NJMerge can be downloaded from Github (https://github.com/ekmolloy/njmerge). ***When downloading datasets, please note that the following errors.*** In tools.zip, the compare_trees.py and the compare_tree_lists.py scripts incorrectly refer to the normalized symmetric difference as the normalized Robinson-Foulds distance. Because the normalized symmetric difference and the normalized Robinson-Foulds distance are equal for binary trees, this does not impact the species tree error rates reported in the study. This can impact the gene tree error rates reported in the study (see data-gene-trees.csv in data.zip), as FastTree-2 returns trees with polytomies whenever 3 or more sequences in the input alignment are identical. Note that the symmetric difference rate is always greater than or equal to the Robinson-Foulds error rate, so the gene tree error rates reported in the study are more conservative.
提供机构:
University of Illinois at Urbana-Champaign
创建时间:
2018-11-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作