inDecay Training data : processed dataframe + indelgen
收藏DataCite Commons2026-04-10 更新2024-08-19 收录
下载链接:
https://figshare.com/articles/dataset/inDecay_Training_data_processed_dataframe_indelgen/25133564/2
下载链接
链接失效反馈官方服务:
资源简介:
The training data for reimplementing inDecay and FORECasT.The fasta file records the guide RNA, strand, cut-site, and target sequence matched by OligoID.The indelgen folder contains the indelgen file for each OligoID. Each indelgen file records all possible indel events estimated based on the target sequences.Finally, there are five processed dataframe (really big csv). This dataframe contains all the observed events and event frequency.
本数据集为复现inDecay与FORECasT所用的训练数据。其中的FASTA文件记录了通过OligoID匹配得到的向导RNA(guide RNA)、链型、切割位点以及靶序列。indelgen文件夹包含了每个OligoID对应的indelgen文件,每份indelgen文件均记录了基于靶序列估算得到的全部潜在插入缺失事件。本数据集最终包含五个经预处理的数据帧(DataFrame,实为超大型CSV文件),该数据帧囊括了所有已观测到的事件及其发生频率。
提供机构:
figshare
创建时间:
2024-02-04



