five

Data from: Gene similarity networks provide new tools for understanding eukaryote origins and evolution

收藏
DataONE2013-05-21 更新2024-06-27 收录
下载链接:
https://search.dataone.org/view/null
下载链接
链接失效反馈
官方服务:
资源简介:
The complexity and depth of the relationships between the three domains of life challenge the reliability of phylogenetic methods, encouraging the use of alternative analytical tools. We reconstructed a gene similarity network comprising the proteomes of 14 eukaryotes, 104 prokaryotes, 2,389 viruses and 1,044 plasmids. This network contains multiple signatures of the chimerical origin of Eukaryotes as a fusion of an archaebacterium and a eubacterium that could not have been observed using phylogenetic trees. A number of connected components (gene sets with stronger similarities than expected by chance) contain pairs of eukaryotic sequences exhibiting no direct detectable similarity. Instead, many eukaryotic sequences were indirectly connected through a “eukaryote–archaebacterium–eubacterium–eukaryote” similarity path. Furthermore, eukaryotic genes highly connected to prokaryotic genes from one domain tend not to be connected to genes from the other prokaryotic domain. Genes of archaebacterial and eubacterial ancestry tend to perform different functions and to act at different subcellular compartments, but in such an intertwined way that suggests an early rather than late integration of both gene repertoires. The archaebacterial repertoire has a similar size in all eukaryotic genomes whereas the number of eubacterium-derived genes is much more variable, suggesting a higher plasticity of this gene repertoire. Consequently, highly reduced eukaryotic genomes contain more genes of archaebacterial than eubacterial affinity. Connected components with prokaryotic and eukaryotic genes tend to include viral and plasmid genes, compatible with a role of gene mobility in the origin of Eukaryotes. Our analyses highlight the power of network approaches to study deep evolutionary events.

生命三域(three domains of life)之间的关系兼具复杂性与深度,这对系统发育分析方法(phylogenetic method)的可靠性提出了挑战,也促使研究者转而采用其他分析工具。本研究构建了涵盖14种真核生物(eukaryote)、104种原核生物(prokaryote)、2389种病毒与1044个质粒(plasmid)的蛋白质组(proteome)基因相似性网络。该网络包含了真核生物作为古细菌(archaebacterium)与真细菌(eubacterium)融合产物的嵌合起源(chimerical origin)的多重特征,而这些特征无法通过系统发育树(phylogenetic tree)观测得到。部分连通分量(connected component,即相似性显著高于随机预期的基因集合)中存在无直接可检测相似性的真核生物序列对;但诸多真核生物序列可通过"真核生物-古细菌-真细菌-真核生物"的相似性路径实现间接连接。此外,与某一类原核生物基因存在高度连接的真核生物基因,通常不会与另一类原核生物基因产生连接。源自古细菌与真细菌祖先的基因,通常行使不同功能并定位在不同的亚细胞区室(subcellular compartment)中,但二者的整合方式高度交织,表明两类基因库(gene repertoire)的整合发生在演化早期而非晚期。所有真核基因组中的古细菌来源基因库规模均较为相近,而真细菌来源基因的数量则差异显著,这表明真细菌来源的基因库具有更高的可塑性。因此,基因组高度简化的真核生物中,古细菌亲缘性基因的数量多于真细菌亲缘性基因。包含原核生物与真核生物基因的连通分量通常也包含病毒与质粒基因,这与基因移动在真核生物起源中发挥作用的假说相符。本研究的分析结果凸显了网络分析方法在研究深层演化事件中的优势。
创建时间:
2013-05-21
二维码
社区交流群
二维码
科研交流群
商业服务