five

Benchmark data used in the evaluation of ChiRA tool-suite.

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4289364
下载链接
链接失效反馈
官方服务:
资源简介:
The reads generated mimic CLASH experimental data. Each read is a fusion of a human hg38 miRBase mature miRNAs and some random TargetScan target sequences. The numbers 10, 12, 15, 18, and 20 in the file names represent the length of the chimeric arms. The are 1million reads in FASTA file. Files with "Insert" in their names contain a short 5 nt random sequence, whereas the "noInsert" files do not.  TargetScanSites_merged.fa.fasta and hg38_MIR_mature.fa.fasta contain the reference sequences. The sequence identifiers in the FASTA file are in the following format: >hsa-miR-193b-5p_MIMAT0004767:1-20||chr7:114656010-114656173+:82-95 where, hsa-miR-193b-5p_MIMAT0004767 is the sequence ID of the miRNA from which first chimeric arm of this read was derived from. This ID can be found in hg38_MIR_mature.fa.fasta. 1-20 represents the 1-based start and end positions on hsa-miR-193b-5p_MIMAT0004767 representing the origin of the first chimeric arm. chr7:114656010-114656173+ is the sequence ID of the TargetScan target site from which the second chimeric arm of this read was derived from. This ID can be found in TargetScanSites_merged.fa.fasta 82-95 represents the 1-based start and end positions on chr7:114656010-114656173+ representing the origin of the second chimeric arm.
创建时间:
2020-11-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作