five

Learning the Sequence Determinants of Alternative Splicing from Millions of Random Sequences

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE74070
下载链接
链接失效反馈
官方服务:
资源简介:
Most human transcripts are alternatively spliced, and many disease-causing mutations affect RNA splicing. Towards better modeling the sequence determinants of alternative splicing, we measured the splicing patterns of nearly 2 million (M) synthetic mini-genes, which include degenerate subsequences totaling to nearly 100M bases of variation. The massive size of these training data allowed us to improve upon current models of splicing as well as to gain new mechanistic insights. Our results show that a vast majority of hexamer sequence motifs measurably influence splice site selection when positioned within alternative exons, with multiple motifs acting additively rather than cooperatively. Intriguingly, motifs that enhance (suppress) exon inclusion in alternative 5’ splicing also enhance (suppress) exon inclusion in alternative 3’ or cassette exon splicing, suggesting a universal mechanism for alternative exon recognition. Finally, our empirically trained models are highly predictive of the effects of naturally occurring variants on alternative splicing in vivo. HEK293 cells were transfected with two alternatively spliced plasmid libraries. Spliced reads were sequenced to determine isoform counts for each library sequence.
创建时间:
2019-05-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作