bm2-lab/CRISPRviva-3B
收藏Hugging Face2024-10-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/bm2-lab/CRISPRviva-3B
下载链接
链接失效反馈官方服务:
资源简介:
CRISPRviva-3B是一个大型转录组序列语料库,包含从23种细胞系和200多个RNA病毒的分段基因组中提取的超过37亿个序列。该数据集用于建立基础模型,以表征CRISPR引导RNA靶向区域的多样性,并用于进一步的CRISPR-based RNA病毒检测和抑制的下游任务。
CRISPRviva-3B is a large transcriptome sequence corpus consisting of over 3.7 billion sequences extracted from the specific transcriptome of 23 cell lines and over 200 segmented genomes of RNA virus. This dataset supports tasks based on these nucleotide sequence corpora, establishing a foundational model to characterize the manifold of CRISPR guide RNA targeting regions, enabling further downstream tasks for universal CRISPR-based RNA virus detection and inhibition.
提供机构:
bm2-lab



