five

Species from Hidden patterns of codon usage bias across kingdoms

收藏
The Royal Society Figshare2024-02-15 更新2026-04-17 收录
下载链接:
https://rs.figshare.com/articles/dataset/Species_from_Hidden_patterns_of_codon_usage_bias_across_kingdoms/11794185/1
下载链接
链接失效反馈
官方服务:
资源简介:
The genetic code is necessarily degenerate with 64 possible nucleotide triplets being translated into 20 amino acids. 18 out of the 20 amino acids are encoded by multiple synonymous codons. While synonymous codons are clearly equivalent in terms of the information they carry, it is now well established that they are used in a biased fashion. There is currently no consensus as to the origin of this bias. Drawing on ideas from stochastic thermodynamics we derive from first principles a mathematical model describing the statistics of codon usage bias. We show that the model accurately describes the distribution of codon usage bias of genomes in the fungal and bacterial kingdoms. Based on it, we derive a new computational measure of codon usage bias—the distance D capturing two aspects of codon usage bias: (i) Differences in the genome-wide frequency of codons and (ii) apparent non-random distributions of codons across mRNAs. By means of large scale computational analysis of over 900 species across two kingdoms of life, we demonstrate that our measure provides novel biological insights. Specifically, we show that while codon usage bias is clearly based on heritable traits and closely related species show similar degrees of bias, there is considerable variation in the magnitude of D within taxonomic classes suggesting that the contribution of sequence-level selection to codon bias varies substantially within relatively confined taxonomic groups. Interestingly, commonly used model organisms are near the median for values of D for their taxonomic class, suggesting that they may not be good representative models for species with more extreme D, which comprise organisms of medical an agricultural interest. We also demonstrate that amino acid specific patterns of codon usage are themselves quite variable between branches of the tree of life, and that some of this variability correlates with organismal tRNA content.
提供机构:
Kalfon, Jeremie; de Lima Hedayioglu, Fabio; Deng, Yun; von der Haar, Tobias; Chu, Dominique
创建时间:
2020-02-03
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作