five

Baiting out a full length sequence from unmapped RNA-seq data

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE172487
下载链接
链接失效反馈
官方服务:
资源简介:
Usually, unmapped reads have been considered as useless and been trashed or ignored. Here, we develop a strategy to mining the full length sequence by unmapped reads combining with specific reverse transcription primers design and high throughput sequencing. In this study, we salvage 36 unmapped reads from standard RNA-Seq data(GSM3188619) and randomly select one 149 bp read as a model(CTGGTGCCATAATTCAGGGAACTGTGTTCTTGATGTACTATCTGAGACATTTGTGCTTCCCCCCATCCAGCTATCAGGCTGTTAGGCAATGCACTTCTAGGAATTAGAATTCTATAAGGAATCTCATGCTGGAAGAACAAAAAGACCCA ). Specific reverse transcription primers(5' end:CTGGTGCCATAATTCAGGGA, 3' end:GGATCTTCACGTAACGGATTGT) are designed to amplify its both ends, followed by next generation sequencing. Then we use a statistical model base on power law distribution to estimate its integrality and significance. Further, we validate it by Sanger sequencing. The result shows that the full length is 1,556 bp, with InDel mutation in microsatellite structure. This would be a useful strategy to extract the sequences information from the unmapped RNA-seq data. The library of full length sequence of unmapped reads captured by specific reverse transcription primers
创建时间:
2021-12-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作