five

Inferring gene function from evolutionary change in signatures of translation efficiency

收藏
NIAID Data Ecosystem2026-03-07 收录
下载链接:
https://figshare.com/articles/dataset/Inferring_gene_function_from_evolutionary_change_in_signatures_of_translation_efficiency/1281372
下载链接
链接失效反馈
官方服务:
资源简介:
Krisko, Copic, Gabaldon, Lehner and Supek. Genome Biol 2014. doi:10.1186/gb-2014-15-3-r44 "Inferring gene function from evolutionary change in signatures of translation efficiency" Additional file 18. All COG gene families found to be significantly enriched/depleted for highly expressed (HE) genes across 911 microbial genomes in any of the 24 examined phenotypes or environments. The HE genes were determined by quantifying the use of optimal codons, which was compared against the composition of intergenic DNA using a randomization test. (Table A) All 200 shown expression-phenotype links have (i) Fisher's exact test for enrichment P<0.01, (ii) magnitude of enrichment >2x or <0.5x, and (iii) Random Forest (RF) randomization test for confounding phenotypes/environments/phylogeny P<0.01. A total of 187 unique COGs are represented in the list. The "numOrgs" column shows a number of organisms positive for the given trait, and the total number of organisms where that trait is defined. For instance, traits named "Mammalian pathogen: (tissue or organ)" are positive for pathogens infecting that organ, negative for all other mammalian pathogens, and undefined for the rest of examined Bacteria and Archaea. In some cases (e.g. halophiles), many microbes have undefined values due to missing annotations. In every COG, column "a" is the number of HE genes in microbes positive for the given trait, "b" is the count of non-HE genes in the same microbes, "c" is HE genes in microbes where the trait is negative and "d" is the count of non-HE genes in the negative organisms. "Enrich" is the enrichment = a/(a+b) / ( c/(c+d) ). (Table B) Same as table A, but with the threshold for the RF test for counfounders slightly relaxed to P<0.05, yielding 167 additional COG-phenotype links (total: 367). (Table C) A set of 2386 inferred COG-phenotype links with broader coverage due to relaxed criteria for magnitude of enrichment (1.5x or 0.66x, instead of 2x or 0.5x); the criteria for significance of the enrichment remained the same (P<0.01 in Fisher's exact test). Additionally, the COG-phenotype links in this table did not have to pass the RF randomization test for confounders (meaning there is no control for phylogeny, genome size/GC content, or the inter-correlation of phenotypes).
创建时间:
2015-01-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作