five

Datasets and additional data used in the paper: Michela Quadrini, Emanuela Merelli, and Luca Tesei (2025). “Abstractions for RNA Secondary Structure Analysis.”

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Datasets_and_additional_data_used_in_the_paper_Michela_Quadrini_Emanuela_Merelli_and_Luca_Tesei_2025_Abstractions_for_RNA_Secondary_Structure_Analysis_/30418705
下载链接
链接失效反馈
官方服务:
资源简介:
Datasets and Additional DataThis repository contains the datasets and additional data used in the paper: Michela Quadrini, Emanuela Merelli, and Luca Tesei (2025). “Abstractions for RNA Secondary Structure Analysis.” Folder: 5S_Ribosomal_RNAThis folder contains the data used for the experiment on 264 5S ribosomal RNA (5S rRNA) molecules from the CRW2 database. Subfolder Molecules_DBN/ Contains the secondary structures of the 264 RNAs in dot-bracket notation.Subfolder Molecules_Abstractions_DBN/Shape/ Contains the computed Shape abstractions and a text file with the counts of different shapes.Subfolder Molecules_Abstractions_DBN/Core/ Contains the computed Core abstractions and a text file with the counts of different cores.Subfolder Molecules_Abstractions_DBN/Core_Plus/ Contains the computed Core Plus abstractions and a text file with the counts of different core-plus patterns.Folder: Pseudoknot_MotifsThis folder contains the data used for the experiment on 332 RNA pseudoknots from the Pseudobase++ database. Molecules in dot-bracket notationSubfolder Molecules_DBN/Pseudoknot_H/ Contains the secondary structures of the 253 RNAs with H-type pseudoknots.Subfolder Molecules_DBN/Pseudoknot_HLout/ Contains the secondary structures of the 64 RNAs with HL_out-type pseudoknots.Subfolder Molecules_DBN/Pseudoknot_HLin/ Contains the secondary structures of the 4 RNAs with HL_in-type pseudoknots.Subfolder Molecules_DBN/Pseudoknot_LL/ Contains the secondary structures of the 11 RNAs with LL-type pseudoknots.Computed abstractionsSubfolder Molecules_Abstractions_DBN/Shape/ Contains the computed Shape abstractions and a text file with the counts of different shapes for each pseudoknot type (H, HL_out, HL_in, LL).Subfolder Molecules_Abstractions_DBN/Core/ Contains the computed Core abstractions and a text file with the counts of different cores for each pseudoknot type (H, HL_out, HL_in, LL).Subfolder Molecules_Abstractions_DBN/Core_Plus/ Contains the computed Core Plus abstractions and a text file with the counts of different core-plus patterns for each pseudoknot type (H, HL_out, HL_in, LL).

数据集与附加数据 本仓库包含2025年发表的论文《Abstractions for RNA Secondary Structure Analysis》(作者:Michela Quadrini、Emanuela Merelli及Luca Tesei)所使用的数据集与附加数据。 ### 5S核糖体RNA(5S Ribosomal RNA)文件夹 本文件夹包含针对CRW2数据库中264条5S核糖体RNA(5S Ribosomal RNA, 5S rRNA)分子开展实验所用的数据。 - 子文件夹`Molecules_DBN/`:包含264条RNA的二级结构,以点括号表示法(dot-bracket notation, DBN)存储。 - 子文件夹`Molecules_Abstractions_DBN/Shape/`:包含计算得到的形状抽象(Shape abstraction),以及一份记录不同形状计数的文本文件。 - 子文件夹`Molecules_Abstractions_DBN/Core/`:包含计算得到的核心抽象(Core abstraction),以及一份记录不同核心模式计数的文本文件。 - 子文件夹`Molecules_Abstractions_DBN/Core_Plus/`:包含计算得到的核心加抽象(Core Plus abstraction),以及一份记录不同核心加模式计数的文本文件。 ### 假结基序(Pseudoknot Motifs)文件夹 本文件夹包含针对Pseudobase++数据库中332条RNA假结开展实验所用的数据。 #### 点括号表示法存储的分子数据 - 子文件夹`Molecules_DBN/Pseudoknot_H/`:包含253条携带H型假结的RNA的二级结构。 - 子文件夹`Molecules_DBN/Pseudoknot_HLout/`:包含64条携带HL_out型假结的RNA的二级结构。 - 子文件夹`Molecules_DBN/Pseudoknot_HLin/`:包含4条携带HL_in型假结的RNA的二级结构。 - 子文件夹`Molecules_DBN/Pseudoknot_LL/`:包含11条携带LL型假结的RNA的二级结构。 #### 计算得到的抽象数据 - 子文件夹`Molecules_Abstractions_DBN/Shape/`:包含针对每种假结类型(H、HL_out、HL_in、LL)计算得到的形状抽象,以及一份记录各类形状计数的文本文件。 - 子文件夹`Molecules_Abstractions_DBN/Core/`:包含针对每种假结类型(H、HL_out、HL_in、LL)计算得到的核心抽象,以及一份记录各类核心模式计数的文本文件。 - 子文件夹`Molecules_Abstractions_DBN/Core_Plus/`:包含针对每种假结类型(H、HL_out、HL_in、LL)计算得到的核心加抽象,以及一份记录各类核心加模式计数的文本文件。
创建时间:
2025-10-22
二维码
社区交流群
二维码
科研交流群
商业服务