Pinniped metatree data and scripts
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8276114
下载链接
链接失效反馈官方服务:
资源简介:
This is a copy of relevant data from the pinniped metatree project. Below explains the file structure of the folder.
## InputData
Folder containing the following subfolders:
### Molecular_Data
Folder containing two subdirectories (fulton and Lopes), each of which contains a further two subdirectories (mrp and xml) that contain the Matrix Representations with Parsimony (MRP) nexus files and XML files (see below), respectivly, for the two molecular studies used as backbone constraints
### MRP
Matrix Representations with Parsimony (MRPs) obtained by re-analysing each morphological input data set under parsimony and retaining only unique biparitions until all such biparitions have been sampled.
### NEXUS
The original character-taxon matrices for each morphological input data set.
### TNT
TNT format input files used for reanalysis of each morphological input dataset. Each file contains the relevant commands to run the analyses.
### XML
XML files that record important metadata about each input data set, particularly data set dependence and taxonomic reconciliation.
## metatree_files
Folder containing the metatree data organised in the following subfolders:
### Consensus_trees
Folder containing Strict (SCC) and Majority Rule (MRC) trees before (STR_***) and after (no prefix) safely reinserting taxa
### Files
All files output by running the metatree function to assemble a reconciled complete MRP matrix over all input datasets:
#### Character weights
weights for each character
#### DataSetWeights.txt
information of the contribution of each dataset to the overall analysis
#### FULL.nex / FULL.tnt
MRP files in nexus and tnt format containing all taxa (not used in main analysis)
#### STR.nex / STR.tnt
Safely reduced input files in nexus and tnt format. The tnt format file is the one used in main analyses.
### STR.tnt
text file containing safely removed taxa and the rules used to reinsert them
#### TaxonomyTreee.tre
newick string containing the paleobiology database taxonomy for PanPinnipeds in tree form. Used in the metatree analysis as a very downweighted input.
### MPTS
Folder containing all most parsimonious trees before and after Safe Taxonomic Reinsertion
### tnt
MPTs from each of 1000 independent TNT searches (.tnt) and associated search info from screen output (.txt)
### README.md
This file.
### Scripts
The set of R scripts used to generate the metatree
#### BuildBeastXML.R
A function used to assemble a nexus file, starting tree, topology constraints, and age priors for BEAST input. Is sourced by the file CreateBeastInput_Script.R
#### Build_Metatree_Files_Combined.R
Script to read in input files from input_files folder and assemble MRP matrices for generation of metatree
#### CreateBeastInput_Script.R
Script to read in a set of input files contained in the TimeTree folder and output a set of files that can be used to build a BEAST XML for Fossilized Birth Death tip dating analyses with a relaxed molecular clock
#### makemrpandscandmpts.R
Utility script to read the output of TNT for each source study, produce separate MPT and Strict Cosensus files, and generate an MRP file that can be used for input into the metatree function
#### process_beast_output.R
Script to read combined, post-burnin output from beast and find the median tree under the Kendall-Colijn distance metric. This tree is used in subsequent macroevolutionary analyses.
#### runtntoutputcollator.R
Utility script to read the output from TNT analysis of the ensemble MRP matrix, find all unique shortest trees, and assemble a combined MPT file, consensus trees, and perform safe taxonomic reinsertion.
### TimeTree
Files and Folders related to BEAST time-tree inference. Organized as follows:
#### BEAST_Run
Folder containing input xml file, log and tree files from the two independent MCMC analyses, combined post-burnin log and tree files, and the median tree from the combined posterior sample.
#### partitionFinder
PartitionFinder input and output files used to identify the appropriate partitioning scheme and evolutionary models for the augmented Fulton and Strobeck alignment.
#### strictclockrun
xml and output files from a short analysis of extant pinnipeds only under a strict molecular clock. The posterior mean root height divided by the approximate age of crown pinnipeds was used to define a mean for the lognormal prior on the relaxed molecular clock in the main analysis
#### MRC.tre
The majority Rule Consensus metatree used to derive topological constraints for the BEAST analysis
#### pinnipeds_extant_beast.nex
augmented version of the Fulton and Strobeck alignment used in BEAST analysis
#### Taxa_FAD.csv
Stratigraphic uncertainty associated with the first appearances of fossil taxa. Used to define age ranges for fossils for the Fossilized Birth Death analysis.
### TimeTreeInference
Folder containing the R scripts and temporal data used to build the timetrees as well as the following subfolders:
#### Dangerous
Subfolder containing the results for the Dangerous timetree analyses.
#### Risky
Subfolder containing the results for the Risky timetree analyses.
#### Safe
Subfolder containing the results for the Safe timetree analyses.
创建时间:
2023-08-23



