five

ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data (repository for simulated ONT RNA-seq data)

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7246436
下载链接
链接失效反馈
官方服务:
资源简介:
Simulated ONT direct RNA and 1D cDNA sequencing data of varying sequencing depths (0.5 million, 1 million, 3 million, and 5 million simulated reads) used for benchmark evaluations of transcript discovery and quantification in our paper "ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data". All details can be found in the Materials and Methods section of the paper.  HEK293T_DirectRNA.transcriptome_quantification.tsv and HEK293T_DirectRNA.transcriptome_quantification.tsv are tab-separated files containing estimated raw read counts and normalized abundance values (in TPM) of transcripts annotated in GENCODE v34lift37. Transcript quantification was done using NanoSim (version 3.1.0).  HEK293T_DirectRNA.NanoSim_500k.fastq.gz, HEK293T_DirectRNA.NanoSim_1M.fastq.gz, HEK293T_DirectRNA.NanoSim_3M.fastq.gz, and HEK293T_DirectRNA.NanoSim_5M.fastq.gz are gzip compressed FASTQ files containing 0.5 million, 1 million, 3 million, and 5 million simulated ONT direct RNA sequencing reads respectively.  HEK293T_1DcDNA.NanoSim_500k.fastq.gz, HEK293T_1DcDNA.NanoSim_1M.fastq.gz, HEK293T_1DcDNA.NanoSim_3M.fastq.gz, and HEK293T_1DcDNA.NanoSim_5M.fastq.gz are gzip compressed FASTQ files containing 0.5 million, 1 million, 3 million, and 5 million simulated ONT 1D cDNA sequencing reads respectively.
创建时间:
2022-10-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作