five

De novo sequencing and comprehensive analysis of purple sweetpotato (Impomoea batatas L.) transcriptome. Ipomoea batatas

收藏
NIAID Data Ecosystem2026-03-07 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA80119
下载链接
链接失效反馈
官方服务:
资源简介:
High-throughput RNA sequencing (RNA-seq) was performed on fresh tuberous roots of the purple sweetpotato. A total of 25,888,890 high quality reads with 2,330,000,100 nucleotides (nt) and a GC content of 48.92%, were generated from pair-end sequencing. According to de novo assembly by SOAPdenovo, 58,800 unigenes were obtained and ranged from 200 nt to 10,380 nt with an average length of 476 nt. One unigene on average is assembled by 137 reads with maximum reads of 6,242. Using the RPKM (Reads Per Kb per Million reads) method, we estimated that the average expression of one unigene is 34 RPKM with a maximum expression of 1,935 RPKM. BLASTX searches between all unigenes and the database of non-redundant proteins (nr) in NCBI showed that at least 40,280 (68.5%) unigenes were identified to be protein-coding genes. Amongst the 40,280 unigenes, 11,978 and 5,184 genes are homologous to Arabidopsis proteins and rice proteins, respectively. Based on the analysis against the databases of Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG), 19,707 (33.5%) unigenes were classified to 1,807 terms of GO including molecular functions, biological processes, and cellular components, and 9,970 (17.0%) unigenes were enriched to 11,119 KEGG pathways. According to functional analysis of unigenes, we found that at least 3,553 genes may be involved in biosynthesis pathways of starches, alkaloids, anthocyanin pigments, and vitamins. In addition to mining more potential molecular markers for sweetpotato breeding, 851 simple sequence repeats (SSRs, also named microsatellites) were identified in all unigenes.
创建时间:
2011-12-20
二维码
社区交流群
二维码
科研交流群
商业服务