Matrix describing individual features used in phylogenetic analysis of vRdPs.

NIAID Data Ecosystem2026-03-08 收录

下载链接：

https://figshare.com/articles/dataset/_Matrix_describing_individual_features_used_in_phylogenetic_analysis_of_vRdPs_/1022213

下载链接

链接失效反馈

官方服务：

资源简介：

Individual vRdP structures are introduced by PBD code-strain and they are assigned to a virus species. Rows in the matrix represent vRdPs, while the compared features are listed as 21 columns. Compared features are: (A) polymerase product - 0 RNA, 1 DNA; (B) polymerase template - 0 RNA, 1 both DNA and RNA; (C) NA synthesis initiation - 0 de novo, 1 protein primer, 2 RNA primer; (D) overall polymerase domain architecture as described in [23] - 0 active site is encircled by finger tips, 1 active site is open (fingers subdomain do not touch thumb subdomain); (E) polymerase core organization - 0 ABC, 1 CAB; (F) motif F length - 0 normal (motif is F2 is present), 1 short (motif F2 is absent), 2 long (insertion is present in motif F); (G) motif F structure - 0 ββα(310)β, 1 βββ, 2 ββ; (H) F - A (C) motif connection - 0 short (≤35 amino acid residues), 1 long structured (>35 amino acid residues); (I) motif A structure - 0 -310, 1 βα, 2 β310; (J) A–B motif connection - 0 ααββ, 1 αββαββ, 2 ββ; (K) length of helix in motif B - 0 normal (≤21 amino acid residues), 1 long (>22 amino acid residues); (L) kink in motif B - 0 absent, 1 present; (M) B - C (D) motifs connection - 0 very short (≤5 amino acid residues), 1 loop (6–14 amino acid residues), 2 long helical (≥15 amino acid residues, at least 8 amino acid residues long helix); (N) motif C length - 0 short (10 amino acid residues), 1 long (>10 amino acid residues); (O) C (B)–D motifs connection - 0 short loop (≤5 amino acid residues), 1 long loop (>5 amino acid residues); (P) motif D structure - 310α-, 1 α-, 2αβ; (Q) position of helix in motif D - 0 normal position, 1 shifted position; (R) D–E motif connection - 0 short (<20 amino acid residues), 1 long structured (<20 amino acid residues); (S) motif E structure - 0 wide, 1 narrow; (T) thumb domain size - 0 large (>180 amino acid residues), 1 small (<180 amino acid residues); (U) priming motif - 0 none, 1 priming loop in thumb subdomain, 2 priming loop in palm subdomain, 3 polymerase C terminal part. Symbols α, β, 310, and L mean α helix, β strand, 310 helix, and loop, respectively.

单个病毒RNA依赖RNA聚合酶（viral RNA-dependent RNA polymerase, vRdP）结构以PBD编号-菌株的形式进行标注，并被归类至相应病毒物种。该矩阵的行代表各个vRdP，待比较的特征以21个列项进行罗列。待比较的特征如下：(A) 聚合酶产物：0 对应RNA，1 对应DNA；(B) 聚合酶模板：0 对应RNA，1 可同时适配DNA与RNA；(C) 核酸合成起始方式：0 为从头合成，1 以蛋白作为引物，2 以RNA作为引物；(D) 整体聚合酶结构域架构（引自文献[23]）：0 为活性位点被指状结构域末端环绕，1 为活性位点呈开放状态（指状亚结构域未与拇指亚结构域接触）；(E) 聚合酶核心组织形式：0 为ABC型，1 为CAB型；(F) 基序F长度：0 为正常长度（存在基序F2），1 为短长度（缺失基序F2），2 为长长度（基序F中存在插入序列）；(G) 基序F二级结构：0 为ββα(3₁₀)β型，1 为βββ型，2 为ββ型；(H) F-A（C）基序连接区域：0 为短连接（≤35个氨基酸残基），1 为长且具有结构化特征的连接（>35个氨基酸残基）；(I) 基序A二级结构：0 为-3₁₀型，1 为βα型，2 为β3₁₀型；(J) A-B基序连接区域：0 为ααββ型，1 为αββαββ型，2 为ββ型；(K) 基序B内螺旋长度：0 为正常长度（≤21个氨基酸残基），1 为长长度（>22个氨基酸残基）；(L) 基序B扭结情况：0 无扭结，1 存在扭结；(M) B-C（D）基序连接区域：0 为极短连接（≤5个氨基酸残基），1 为环结构连接（6-14个氨基酸残基），2 为长螺旋结构连接（≥15个氨基酸残基，其中螺旋长度至少为8个氨基酸残基）；(N) 基序C长度：0 为短长度（10个氨基酸残基），1 为长长度（>10个氨基酸残基）；(O) C（B）-D基序连接区域：0 为短环结构（≤5个氨基酸残基），1 为长环结构（>5个氨基酸残基）；(P) 基序D二级结构：0 为3₁₀α-型，1 为α-型，2 为αβ型；(Q) 基序D内螺旋的位置：0 为正常位置，1 为偏移位置；(R) D-E基序连接区域：0 为短连接（<20个氨基酸残基），1 为长且结构化的连接（<20个氨基酸残基）；(S) 基序E二级结构：0 为宽型，1 为窄型；(T) 拇指结构域大小：0 为大尺寸（>180个氨基酸残基），1 为小尺寸（<180个氨基酸残基）；(U) 引发基序：0 无引发基序，1 为拇指亚结构域中的引发环，2 为掌状亚结构域中的引发环，3 为聚合酶C端区域。文中符号α、β、3₁₀及L分别代表α螺旋、β折叠链、3₁₀螺旋与环结构。

创建时间：

2014-05-09

5,000+

优质数据集

54 个

任务类型

进入经典数据集