Searching for RNA genes using base-composition statistics

Name: Searching for RNA genes using base-composition statistics
Creator: Oxford University Press
Published: 2002-05-01 00:00:00
License: 暂无描述

PubMed Central2002-05-01 更新2026-05-16 收录

下载链接：

https://pmc.ncbi.nlm.nih.gov/articles/PMC113829/

下载链接

链接失效反馈

官方服务：

资源简介：

The hypothesis that genomic regions rich in non-protein-coding RNAs (ncRNAs) can be identified using local variations in single-base and dinucleotide statistics has been investigated. (G+C)%, (G–C)% difference, (A–T)% difference and dinucleotide-frequency statistics were compared among seven classes of ncRNAs and three genomes. Significant variations were observed in (G+C)% and, in Methanococcus jannaschii, in the frequency of the dinucleotide ‘CG’. Screening programs based on these two base-composition statistics were developed. With (G+C)% screening alone, a 1% fraction of the M.jannaschii genome containing all 44 known transfer RNAs, ribosomal RNAs and signal recognition particle RNAs could be identified. When (G+C)% combined with CG dinucleotide-frequency screening was used, 43 of the 44 known M.jannaschii structural ncRNAs were again identified, while the number of presumably false hits overlapping a known or putative protein-coding gene was reduced from 15 to 6. In addition, 19 candidate ncRNAs were identified including one with significant homology to several known archaeal RNaseP RNAs.

提供机构：

Oxford University Press

创建时间：

2002-05-01

5,000+

优质数据集

54 个

任务类型

进入经典数据集