five

Training data for de novo transcriptome reconstruction from RNA-seq data

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/records/583140
下载链接
链接失效反馈
官方服务:
资源简介:
The data provided here are part of a Galaxy Training Network tutorial that analyzes RNA-seq data using a de novo transcriptome reconstruction strategy from a study published by Wu et al., 2014 (DOI:10.1101/gr.164830.113). The goal of this study was to investigate "the dynamics of occupancy and the role in gene regulation of the transcription factor Tal1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation." To this end, RNA-seq libraries were constructed from multiple mouse cell types including G1E - a GATA-null immortalized cell line derived from targeted disruption of GATA-1 in mouse embryonic stem cells - and megakaryocytes. This RNA-seq data was used to determine differential gene expression between G1E and megakaryocytes and later correlated with Tal1 occupancy. This dataset (GEO Accession: GSE51338) consists of biological replicate, paired-end, polyA selected RNA-seq libraries. Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to a subset of interesting genomic loci identified by Wu et al. This dataset represents an even smaller set of data than another training data set (DOI:10.5281/zenodo.254485).
创建时间:
2020-01-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作