A modified GC-specific MAKER gene annotation method reveals improved and novel gene predictions of high and low GC content in Oryza sativa
收藏DataONE2020-06-24 更新2025-05-03 收录
下载链接:
https://search.dataone.org/view/sha256:c7a6e4d3a1b3af868237395f3837c45dc083c56baf1cfdb3f42f3fb246fde347
下载链接
链接失效反馈官方服务:
资源简介:
Background: Accurate structural annotation depends on well-trained gene prediction programs. Training data for gene prediction programs are often chosen randomly from a subset of high-quality genes that ideally represent the variation found within a genome. One aspect of gene variation is GC content, which differs across species and is bimodal in grass genomes. When gene prediction programs are trained on a subset of grass genes with random GC content, they are effectively being trained on two classes of genes at once, and this can be expected to result in poor results when genes are predicted in new genome sequences.
Results: We find that gene prediction programs trained on grass genes with random GC content do not completely predict all grass genes with extreme GC content. We show that gene prediction programs that are trained with grass genes with high or low GC content can make both better and unique gene predictions compared to gene prediction programs that are trained on genes ...
创建时间:
2025-04-20



