five

Single cell Iso-Sequencing enables rapid genome annotation for scRNAseq analysis

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.5061%252Fdryad.0k6djhb1x
下载链接
链接失效反馈
官方服务:
资源简介:
Single cell RNA sequencing (scRNAseq) is a powerful technique that continues to expand across various biological applications. However, incomplete 3’ UTR annotations can impede single cell analysis resulting in genes that are partially or completely uncounted. Performing scRNAseq with incomplete 3’ UTR annotations can hinder the identification of cell identities and gene expression patterns and lead to erroneous biological inferences. We demonstrate that performing single cell isoform sequencing (ScISOr-Seq) in tandem with scRNAseq can rapidly improve 3' UTR annotations. Using threespine stickleback fish (Gasterosteus aculeatus), we show that gene models resulting from a minimal embryonic ScISOr-Seq dataset retained 26.1% greater scRNAseq reads than gene models from Ensembl alone. Furthermore, pooling our ScISOr-Seq isoforms with a previously published adult bulk Iso-Seq dataset from stickleback, and merging the annotation with the Ensembl gene models, resulted in a marginal improvement (+0.8%) over the ScISOr-Seq only dataset. In addition, isoforms identified by ScISOr-Seq included thousands of new splicing variants. The improved gene models obtained using ScISOr-Seq lead to successful identification of cell types and increased the reads identified of many genes in our scRNAseq stickleback dataset. Our work illuminates ScISOr-Seq as a cost-effective and efficient mechanism to rapidly annotate genomes for scRNAseq. Methods This dataset originates from an experiment where 70hpf embryos were dissociated into single cells then captured by the 10X Single Cell Genomics 3' Genome Expression mRNA-Seq prep with v3.1 NextGem chemistry. After capture, the library was split into two libraries, one was sequenced using illumina's NovaSeq 6000 and the other was sequenced by Pacbio Sequel 2. The single cell ISO sequencing (ScISOrSeq) was processed using PacBio's SMRT Analysis software, custom scripts (https://github.com/hopehealey/scISOseq_processing), cDNA cupcake, and SQANTI3. Additional sequencing data (from Naftaly, Pau, and White 2021) was also processed with PacBio's SMRT Analysis software, cDNA cupcake, and SQANTI3. The produced annotation file was merged with the stickleback annotation (BROAD S1: 104.1 database version, downloaded from Ensembl) using TAMA. The new annotations were tested with Cell Ranger to see how well they captured the generated stickleback scRNAseq reads.
创建时间:
2022-02-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作