five

RR3DD: an RNA global structure-based RNA three-dimensional structural classification database

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://figshare.com/articles/dataset/RR3DD_an_RNA_global_structure-based_RNA_three-dimensional_structural_classification_database/16828963
下载链接
链接失效反馈
官方服务:
资源简介:
The three-dimensional (3D) structure of RNA usually plays an important role in the recognition with RNA-binding protein. Along with the discovering of RNAs, several RNA databases are developed to study the functions of RNA based on sequence, secondary structure, local 3D structural motif and global structure. Based on RNA function and structure, different RNAs are classified and stored in SCOR and DARTS, respectively. The classification of RNA structures is useful in RNA structure prediction and function annotation. However, the SCOR and DARTS are not updated any more. In this study, we present an RNA classification database RR3DD based on RNA fold with the global 3D structural similarity. The RR3DD includes 13,601 RNA chains from PDB and mmCIF format structures which are classified into 780 RNA folds. The RNA chains from PDB and mmCIF format structures are aligned and clustered into 675 and 220 RNA folds, respectively. By analysing the RNA structure in RR3DD, we find that there are 11 clusters with more than 50 members. These clusters include rRNAs, riboswitches, tRNAs and so on. By mapping RR3DD into Rfam, we found that some RNAs without annotation by Rfam can be annotated through structural alignment. For example, we analysed tRNAs and found that tRNA were successfully grouped in RR3DD for which Rfam did not classify them into one family. Finally, we provide a web interface of RR3DD offering functions of browsing RR3DD, annotating RNA 3D structure and finding templates for RNA homology modelling.

核糖核酸(RNA)的三维(3D)结构通常在与RNA结合蛋白(RNA-binding protein)的识别过程中发挥关键作用。随着RNA研究的不断推进,多款RNA数据库已被开发,用于基于序列、二级结构、局部三维结构基序(structural motif)以及整体结构开展RNA功能研究。根据RNA的功能与结构特征,不同RNA分别被归类并存储于SCOR与DARTS数据库中。RNA结构分类对于RNA结构预测与功能注释具有重要价值,但SCOR与DARTS目前已停止更新。本研究基于全局三维结构相似性的RNA折叠(RNA fold)类型,构建了RNA分类数据库RR3DD。该数据库收录了来自PDB与mmCIF格式结构的13601条RNA链,并将其划分为780个RNA折叠类;其中,来自PDB格式结构的RNA链经比对聚类后得到675个RNA折叠类,来自mmCIF格式结构的RNA链则得到220个RNA折叠类。通过对RR3DD中的RNA结构进行分析,本研究发现共有11个成员数超过50的聚类簇,涵盖核糖体RNA(rRNAs)、核糖开关(riboswitches)、转运RNA(tRNAs)等类型。通过将RR3DD与Rfam数据库进行映射比对,本研究发现部分未被Rfam注释的RNA可通过结构比对实现功能注释。例如,本研究对tRNAs进行分析后发现,RR3DD成功将原本未被Rfam归为同一家族的tRNAs聚为一类。最后,本研究搭建了RR3DD的网页交互界面,提供数据库浏览、RNA三维结构注释以及RNA同源建模模板查找等功能。
创建时间:
2021-10-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作