five

High resolution annotation of Zebrafish transcriptome using long-read sequencing

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE101843
下载链接
链接失效反馈
官方服务:
资源简介:
With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited by gaps in zebrafish annotation, which is especially pronounced concerning transcripts dynamically expressed during zygotic genome activation (ZGA). To date, short read sequencing has been the principal technology for zebrafish transcriptome annotation. In part because these sequence reads are too short for assembly methods to resolve the full complexity of the transcriptome, the current annotation is rudimentary. By providing direct observation of full-length transcripts, recently refined long-read sequencing platforms can dramatically improve annotation coverage and accuracy. Here, we leveraged the SMRT platform to study the early ZGA-stage zebrafish transcriptome. Our analysis revealed additional novelty and complexity in the zebrafish transcriptome, identifying 2748 high confidence novel transcripts that originated from previously unannotated loci and 1835 new isoforms in previously annotated genes. Pooled RNA of α-amanitin / untreated embryos were collected and profiled with long-read sequencing. Temporally corresponding pre/post ZGA pooled embryonic RNA samples were profiled with short-read RNA-seq. Long-read raw data were assembled into transcripts using IsoSeq  (PMID: 27407110), mapped to the reference GRCz10 genome using GMAP [PMID:15728110] and annotated against the reference transcriptome using Cuffcompare [PMC3334321]. Novel transcripts were compared to short-read data and computationally validated in constructing a final long-read augmented transcriptome.
创建时间:
2021-07-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作