five

Datasets for Lupo et al. (2024) Identification and Characterization of Archaeal Pseudomurein Biosynthesis Genes through Pangenomics

收藏
Figshare2022-11-30 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Datasets_for_Lupo_et_al_2022_Origin_and_Evolution_of_Pseudomurein_Biosynthetic_Gene_Clusters/21641612
下载链接
链接失效反馈
官方服务:
资源简介:
Lupo et al. 2022 Pseudomurein: Archive content for v2 (Nov 2024)Overview... 59 directories, 777 filesREADME.md: this file.command-line.sh: examples of bash commands to use or generate the files stored in this archive.10-archaeal-proteomesThis directory contains an archive (.tar.gz) consisting of FASTA (.faa) files of the 10 organisms used by OrthoFinder.archaeal-genomesThis directory contains an archive (.tar.gz) consisting of 819 FASTA (.faa) files corresponding to the archaeal database.bacterial-genomesThis directory contains an archive (.tar.gz) consisting of 598 FASTA (.faa) files corresponding to the bacterial database.configThis directory contains two configuration files (.yaml) used by the classify-ali.pl perl script from Bio::MUST modules:classifier.yaml is the configuration file used to filter the 6,321 orthologous groups (OGs).five_org_classifier.yaml is the configuration file used to filter the OGs after the round of Forty-Two TBLASTN.BLASTPList of configuration files to run Forty-Two BLASTP (FILTERING OF CANDIDATE PROTEINS; see figure S1 in the main manuscript).TBLASTNList of configuration files to run Forty-Two TBLASTN (FILTERING OF CANDIDATE PROTEINS; see figure S1 in the main manuscript).NCBI_CCDList of HMM (.hmm) profile files downloaded from NCBI CDD database and corresponding HMM search (.hmms) files.ompapa-resultsRaw Ompa-Pa results associated with NCBI CDD HMM profiles.ompapaThis directory contains a list of the OGs that have passed the taxonomic filter (see config sections).alignmentsList of alignments in FASTA format of the OGs included in the retained_OGs.lis file (see ompapa section).bacterial_dbResults of ompapa analyses against the bacterial database. This directory contains two sub-directories:hmms that contains a list of HMM search (.hmms) files.ompapa-results that contains raw ompapa results.hmm_profilesList of HMM (.hmm) profiles of the OGs included in the retained_OGs.lis file (see ompapa section).prokaryotic_dbResults of ompapa analyses against the prokaryotic database. This directory contains two sub-directories:hmms that contains a list of HMM search (.hmms) files.ompapa-results that contains raw ompapa results.orthologous-groupsThis directory contains an archive (.tar.gz) consisting of 6,321 FASTA (.fasta) files corresponding to the OGs generated by OrthoFinder.predictionsThis directory contains three sub-directories:interproscan that contains raw InterProScan results.signalp5 that contains raw SignalP5 results.tmhmm that contains raw TMHMM results.For the consolidated results, see Table S1 in the main manuscript.prokaryotic-genomesThis directory contains a list of assembly accessions of the 80,490 organisms from the prokaryotic database.regulonThis directory contains all the data needed to reproduce the 'regulon pipeline' (see supplementary data from the main manuscript). All the command lines are included in the command_line_regulon.sh file.scriptsThis directory contains various perl scripts used to generate some of the files stored in this archive (see command-line.sh file).taxdumpMirror of the NCBI Taxonomy used in this study (downloaded on 4th of May 2020).treesThis directory contains three sub-directories:atp-graspmray-familymur-familyAll the sub-directories contains tree (.tre) files presented in the main manuscript and corresponding raw IQ-TREE (.ckp.gz) results. They also contain various files needed to reproduce the phylogenetic analyses (see command-line.sh file).
创建时间:
2022-11-30
二维码
社区交流群
二维码
科研交流群
商业服务