Datasets & Supplemental Information for "MIcromonosporaceae Biosynthetic Gene Cluster Diversity Highlights the Need for Broad Spectrum Investigation".
收藏DataCite Commons2023-08-09 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/Datasets_Supplemental_Information_for_MIcromonosporaceae_Biosynthetic_Gene_Cluster_Diversity_Highlights_the_Need_for_Broad_Spectrum_Investigation_/23907624
下载链接
链接失效反馈官方服务:
资源简介:
In this data collection is:<br><b>Data S1</b>: A folder with all the fasta files, representing the 42 strains (41 <i>Micromonosporaceae</i>, 1 <i>Streptomycetaceae</i>).<br><b>Data S2</b>: A folder with all the .gbk files for the BGC regions predicted by antiSMASH v5.1.1. These files were used as inputs for BiG-SCAPE and BiG-SLiCE.<br><b>Data S3</b>: A folder with all the .gbk files for the BGC regions predicted by antiSMASH v6.1.0.<br><b>Data S4</b>: A folder containing all the Quast outputs for the 42 strains.<br><b>Data S5</b>: A folder containing all the BUSCO outputs for the 42 strains. Example scripts are provided for scraping relevant information from the individual BUSCO outputs.<br><b>Data S6</b>: A folder containing GTDB (Genome Taxonomy Database) classification results, and species-level grouping results using FastANI (95% cutoff).<br><b>Data S7</b>: A folder containing an Interactive Tree of Life (iTOL)-compatible bar chart annotation using antiSMASH v5.1.1 BGC region information.<br><b>Data S8</b>: A folder containing a word document that describes the parameters used with Ubuntu WSL (Windows Subsystem for Linux) on the command line for programs antiSMASH v6.1.2, BiG-SCAPE v1.1.2, and BiG-SLiCE v1.1.1. Also included are parameters for MDSC in python. An example script is also provided for batch queries of BGCs against BiG-SLiCE v1.1.1’s pre-processed dataset of ~1.2 million BGCs.<br><b>Data S9</b>: A folder containing the BiG-SCAPE visualization of the 38 <i>Micromonosporaceae</i> (post-QC filtering, excluding WMMA1363, WMMB482, WMMB486, and WMMC500) in Cytoscape.<br><b>Data S10</b>: A folder containing:<br>The pre-processed dataset of 1.2 million BGCs from BiG-SLiCE.<br>All report folders generated by BiG-SLiCE for the 779 <i>Micromonosporaceae </i>BGCs queried against the 1.2 million BGCs.<br>The results data.db and associated folders for the pre-processed dataset of 1.2 million BGCs.<br><b>Data S11</b>: A folder containing the scripts necessary to regenerate the figures and perform independent analyses, and the relevant data used for the analyses.<br><b>Data S12: </b>A folder containing the results of the nucleotide blast of WMMA1947.region12's siderophore contig against WMMD1120.region14's siderophore contig.
提供机构:
figshare
创建时间:
2023-08-08



