michaelm16/GuideRNA-3B
收藏Hugging Face2024-07-28 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/michaelm16/GuideRNA-3B
下载链接
链接失效反馈官方服务:
资源简介:
GuideRNA-3B是一个大型转录组序列语料库,包含超过37亿对序列,这些序列是从23种细胞系和超过200个RNA病毒的分段基因组中提取的。该数据集旨在建立一个基础模型来表征CRISPR引导RNA靶向区域的多样性,以支持基于CRISPR的RNA病毒无扩增检测和抑制的下游任务。
GuideRNA-3B is a large transcriptome sequence corpus consisting of over 3.7 billion paired sequences extracted from the specific transcriptome of 23 cell lines and over 200 segmented genomes of RNA virus. The dataset aims to establish a foundation model to characterize the manifold of CRISPR guide RNA targeting regions in order to support downstream tasks for universal CRISPR-based RNA virus amplification-free detection and inhibition.
提供机构:
michaelm16



