five

Data files and code on the comparison of SARS-CoV-2 with non-segmented RNA viruses

收藏
DataCite Commons2020-10-02 更新2024-07-28 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Data_files_and_code_on_the_comparison_of_SARS-CoV-2_with_non-segmented_RNA_viruses/12482813/1
下载链接
链接失效反馈
官方服务:
资源简介:
This fileset contains 15 data files and 1 ReadMe file.The data files are as follows:<br>Five Results files in <b>.fasta</b> file format. These are: Result_MacroDomain_.fasta, Result_Spike1_.fasta, Result_Spike2_.fasta, Result_Spike2protein_.fasta and Result_Viroporin_.fasta.<br>Two power point presentations (.<b>pptx</b> file format). These are: Analysis by MegaX_.pptx and Open Reading Frames_Conserved Domain Found in ORF and CDD_Children_.pptx.<br>Three data files in .<b>nwk</b> file format. These are: NeurotropicRNA.noSegmentR1_original tree.nwk, MacroDomainGen_BootstrapTree.nwk and ViroporinGen_BootstrapTree.nwk.<br>One code file in .<b>R</b> file format. This is: Protein Alignment RStudio_msa package_.R.<br>One file in .<b>tex</b> file format. This is: Covid19_.tex.<br>Two files in .<b>txt </b>file format. These are: Covid19_.txt and texshade.sty package_.txt.<br>One file in .<b>sty </b>file format. This is: texshade_.sty.<br>The 5 fasta files contain the results of the multiple protein sequence alignment.The power point presentations Open Reading Frames_Conserved Domain Found in ORF and CDD_Children.pptx contains the search results (snapshot figures) obtained by the Open Reading Frame (ORF) finder and Conserved Domains Database (CDD) database (NCBI). This file provides evidence to show how Figure2 and 3 were made.The power point presentation Analysis by MegaX_.pptx contains the evidence (parameters) to show how the sequences were aligned and how the tree files were made in Figure 1 by MegaX software.The three .nwk files (in Newick tree format) were produced using the MEGAX software. These files contain the data used to construct the phylogenetic trees shown in figures 1, 2B and 2C of the article.The R file contains all the codes required to produce figures 2 and 3 in the article.The Covid19.tex file works together with R Studio, containing the msa package (an R package for Multiple Sequence Alignment) to make Figures 2 and 3 in the article. The sty file is a system file for LaTex and contains codes. <br><br><b>Study aims and methodology</b>: The primary objective of the current study was to determine the possible evolutionary and molecular relationships between SARS-CoV-2 and non-segmented RNA viruses, especially the viruses that can infect the nervous system in infants and children.The whole-genome sequences of 35 non-segmented RNA viruses including 13 CoVs were retrieved from the National Center for Biotechnology Information (NCBI), for the purpose of phylogenetic analysis, which was conducted with MEGAX (Penn State University, PA, USA). All genomic sequences were aligned with the ClustalW algorithm and phylogenetic prediction inferred by the maximum likelihood method and Tamura-Nei model. RStudio (RStudio, Inc., Boston, MA, USA) with msa package was used for multiple protein sequence alignment. For more details on the methodology, please read the related article.<br>
提供机构:
figshare
创建时间:
2020-10-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作