Datasets and additional data used in the paper: Michela Quadrini, Emanuela Merelli, and Luca Tesei (2025). “Abstractions for RNA Secondary Structure Analysis.”
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Datasets_and_additional_data_used_in_the_paper_Michela_Quadrini_Emanuela_Merelli_and_Luca_Tesei_2025_Abstractions_for_RNA_Secondary_Structure_Analysis_/30418705
下载链接
链接失效反馈官方服务:
资源简介:
Datasets and Additional DataThis repository contains the datasets and additional data used in the paper:
Michela Quadrini, Emanuela Merelli, and Luca Tesei (2025).
“Abstractions for RNA Secondary Structure Analysis.”
Folder: 5S_Ribosomal_RNAThis folder contains the data used for the experiment on 264 5S ribosomal RNA (5S rRNA) molecules from the CRW2 database.
Subfolder Molecules_DBN/
Contains the secondary structures of the 264 RNAs in dot-bracket notation.Subfolder Molecules_Abstractions_DBN/Shape/
Contains the computed Shape abstractions and a text file with the counts of different shapes.Subfolder Molecules_Abstractions_DBN/Core/
Contains the computed Core abstractions and a text file with the counts of different cores.Subfolder Molecules_Abstractions_DBN/Core_Plus/
Contains the computed Core Plus abstractions and a text file with the counts of different core-plus patterns.Folder: Pseudoknot_MotifsThis folder contains the data used for the experiment on 332 RNA pseudoknots from the Pseudobase++ database.
Molecules in dot-bracket notationSubfolder Molecules_DBN/Pseudoknot_H/
Contains the secondary structures of the 253 RNAs with H-type pseudoknots.Subfolder Molecules_DBN/Pseudoknot_HLout/
Contains the secondary structures of the 64 RNAs with HL_out-type pseudoknots.Subfolder Molecules_DBN/Pseudoknot_HLin/
Contains the secondary structures of the 4 RNAs with HL_in-type pseudoknots.Subfolder Molecules_DBN/Pseudoknot_LL/
Contains the secondary structures of the 11 RNAs with LL-type pseudoknots.Computed abstractionsSubfolder Molecules_Abstractions_DBN/Shape/
Contains the computed Shape abstractions and a text file with the counts of different shapes for each pseudoknot type (H, HL_out, HL_in, LL).Subfolder Molecules_Abstractions_DBN/Core/
Contains the computed Core abstractions and a text file with the counts of different cores for each pseudoknot type (H, HL_out, HL_in, LL).Subfolder Molecules_Abstractions_DBN/Core_Plus/
Contains the computed Core Plus abstractions and a text file with the counts of different core-plus patterns for each pseudoknot type (H, HL_out, HL_in, LL).
数据集与附加数据
本仓库包含2025年发表的论文《Abstractions for RNA Secondary Structure Analysis》(作者:Michela Quadrini、Emanuela Merelli及Luca Tesei)所使用的数据集与附加数据。
### 5S核糖体RNA(5S Ribosomal RNA)文件夹
本文件夹包含针对CRW2数据库中264条5S核糖体RNA(5S Ribosomal RNA, 5S rRNA)分子开展实验所用的数据。
- 子文件夹`Molecules_DBN/`:包含264条RNA的二级结构,以点括号表示法(dot-bracket notation, DBN)存储。
- 子文件夹`Molecules_Abstractions_DBN/Shape/`:包含计算得到的形状抽象(Shape abstraction),以及一份记录不同形状计数的文本文件。
- 子文件夹`Molecules_Abstractions_DBN/Core/`:包含计算得到的核心抽象(Core abstraction),以及一份记录不同核心模式计数的文本文件。
- 子文件夹`Molecules_Abstractions_DBN/Core_Plus/`:包含计算得到的核心加抽象(Core Plus abstraction),以及一份记录不同核心加模式计数的文本文件。
### 假结基序(Pseudoknot Motifs)文件夹
本文件夹包含针对Pseudobase++数据库中332条RNA假结开展实验所用的数据。
#### 点括号表示法存储的分子数据
- 子文件夹`Molecules_DBN/Pseudoknot_H/`:包含253条携带H型假结的RNA的二级结构。
- 子文件夹`Molecules_DBN/Pseudoknot_HLout/`:包含64条携带HL_out型假结的RNA的二级结构。
- 子文件夹`Molecules_DBN/Pseudoknot_HLin/`:包含4条携带HL_in型假结的RNA的二级结构。
- 子文件夹`Molecules_DBN/Pseudoknot_LL/`:包含11条携带LL型假结的RNA的二级结构。
#### 计算得到的抽象数据
- 子文件夹`Molecules_Abstractions_DBN/Shape/`:包含针对每种假结类型(H、HL_out、HL_in、LL)计算得到的形状抽象,以及一份记录各类形状计数的文本文件。
- 子文件夹`Molecules_Abstractions_DBN/Core/`:包含针对每种假结类型(H、HL_out、HL_in、LL)计算得到的核心抽象,以及一份记录各类核心模式计数的文本文件。
- 子文件夹`Molecules_Abstractions_DBN/Core_Plus/`:包含针对每种假结类型(H、HL_out、HL_in、LL)计算得到的核心加抽象,以及一份记录各类核心加模式计数的文本文件。
创建时间:
2025-10-22



