Cypovirus_Alignment.fasta
收藏Recherche Data Gouv France2023-01-01 更新2026-04-09 收录
下载链接:
https://entrepot.recherche.data.gouv.fr/file.xhtml?persistentId=doi:10.57745/SAHKVL
下载链接
链接失效反馈官方服务:
资源简介:
Alignment of sequences available for the genus cypovirus corresponding to aminoacid sequences of the RNA-dependent RNA polymerase (RdRp). The alignement included representative members of the genus recognized by the ICTV (6 of the 16 species), putative Cypovirus 19 isolated from Operophtera brumata, 14 sequences identified in GenBank as corresponding to undescribed Cypovirus and the sequence we identified from the pine processionary moth (TR92789, Genbank accession number MW584281). Multiple protein sequence alignments were generated with the MAFFT v.7 alignment program with default parameters, using a G-INS-i iterative refinement method. Gblocks method implemented in SEAVIEW v5.0.4 was used for the refinement of alignment in order to eliminate poorly aligned positions and divergent regions of aligned sequences, resulting in 690 amino acids. The optimal substitution models was identified using the SMS program as the LG +G+I+F model.
本数据集为胞质型多角体病毒属(Cypovirus)相关序列的比对结果,其对应靶标为该属病毒的RNA依赖的RNA聚合酶(RNA-dependent RNA polymerase, RdRp)氨基酸序列。本次比对涵盖该属经国际病毒分类委员会(ICTV)认定的代表成员(16个物种中的6个)、从冬尺蛾(Operophtera brumata)中分离得到的推定胞质型多角体病毒19号株、14条在GenBank数据库中注释为未命名胞质型多角体病毒的序列,以及本研究从松异舟蛾中鉴定得到的序列(TR92789,GenBank登录号MW584281)。本研究采用MAFFT v.7多序列比对软件,以默认参数结合G-INS-i迭代优化算法,构建了多重蛋白质序列比对。随后借助SEAVIEW v5.0.4中集成的Gblocks算法对初始比对结果进行优化,以去除比对质量不佳的位点与序列高变区域,最终得到长度为690个氨基酸的有效比对序列。本研究通过SMS程序筛选得到最优进化替换模型为LG +G+I+F模型。
创建时间:
2023-01-01



