five

Pinniped metatree data and scripts

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8276114
下载链接
链接失效反馈
官方服务:
资源简介:
This is a copy of relevant data from the pinniped metatree project. Below explains the file structure of the folder. ## InputData Folder containing the following subfolders: ### Molecular_Data Folder containing two subdirectories (fulton and Lopes), each of which contains a further two subdirectories (mrp and xml) that contain the Matrix Representations with Parsimony (MRP) nexus files and XML files (see below), respectivly, for the two molecular studies used as backbone constraints ### MRP Matrix Representations with Parsimony (MRPs) obtained by re-analysing each morphological input data set under parsimony and retaining only unique biparitions until all such biparitions have been sampled. ### NEXUS The original character-taxon matrices for each morphological input data set. ### TNT TNT format input files used for reanalysis of each morphological input dataset. Each file contains the relevant commands to run the analyses. ### XML XML files that record important metadata about each input data set, particularly data set dependence and taxonomic reconciliation. ## metatree_files Folder containing the metatree data organised in the following subfolders: ### Consensus_trees Folder containing Strict (SCC) and Majority Rule (MRC) trees before (STR_***) and after (no prefix) safely reinserting taxa ### Files All files output by running the metatree function to assemble a reconciled complete MRP matrix over all input datasets: #### Character weights  weights for each character #### DataSetWeights.txt  information of the contribution of each dataset to the overall analysis  #### FULL.nex / FULL.tnt MRP files in nexus and tnt format containing all taxa (not used in main analysis) #### STR.nex / STR.tnt Safely reduced input files in nexus and tnt format. The tnt format file is the one used in main analyses. ### STR.tnt text file containing safely removed taxa and the rules used to reinsert them #### TaxonomyTreee.tre newick string containing the paleobiology database taxonomy for PanPinnipeds in tree form. Used in the metatree analysis as a very downweighted input. ### MPTS Folder containing all most parsimonious trees before and after Safe Taxonomic Reinsertion ### tnt MPTs from each of 1000 independent TNT searches (.tnt) and associated search info from screen output (.txt)  ### README.md This file. ### Scripts The set of R scripts used to generate the metatree #### BuildBeastXML.R A function used to assemble a nexus file, starting tree, topology constraints, and age priors for BEAST input. Is sourced by the file CreateBeastInput_Script.R #### Build_Metatree_Files_Combined.R Script to read in input files from input_files folder and assemble MRP matrices for generation of metatree #### CreateBeastInput_Script.R Script to read in a set of input files contained in the TimeTree folder and output a set of files that can be used to build a BEAST XML for Fossilized Birth Death tip dating analyses with a relaxed molecular clock #### makemrpandscandmpts.R Utility script to read the output of TNT for each source study, produce separate MPT and Strict Cosensus files, and generate an MRP file that can be used for input into the metatree function #### process_beast_output.R Script to read combined, post-burnin output from beast and find the median tree under the Kendall-Colijn distance metric. This tree is used in subsequent macroevolutionary analyses. #### runtntoutputcollator.R Utility script to read the output from TNT analysis of the ensemble MRP matrix, find all unique shortest trees, and assemble a combined MPT file, consensus trees, and perform safe taxonomic reinsertion.  ### TimeTree Files and Folders related to BEAST time-tree inference. Organized as follows: #### BEAST_Run Folder containing input xml file, log and tree files from the two independent MCMC analyses, combined post-burnin log and tree files, and the median tree from the combined posterior sample. #### partitionFinder PartitionFinder input and output files used to identify the appropriate partitioning scheme and evolutionary models for the augmented Fulton and Strobeck alignment.  #### strictclockrun xml and output files from a short analysis of extant pinnipeds only under a strict molecular clock. The posterior mean root height divided by the approximate age of crown pinnipeds was used to define a mean for the lognormal prior on the relaxed molecular clock in the main analysis #### MRC.tre The majority Rule Consensus metatree used to derive topological constraints for the BEAST analysis #### pinnipeds_extant_beast.nex augmented version of the Fulton and Strobeck alignment used in BEAST analysis #### Taxa_FAD.csv Stratigraphic uncertainty associated with the first appearances of fossil taxa. Used to define age ranges for fossils for the Fossilized Birth Death analysis.    ### TimeTreeInference Folder containing the R scripts and temporal data used to build the timetrees as well as the following subfolders: #### Dangerous Subfolder containing the results for the Dangerous timetree analyses. #### Risky Subfolder containing the results for the Risky timetree analyses. #### Safe Subfolder containing the results for the Safe timetree analyses.
创建时间:
2023-08-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作