five

Supporting data for "Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes."

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100267
下载链接
链接失效反馈
官方服务:
资源简介:
Phosphorylation is the most frequent post-translational modification made to proteins and may regulate protein activity as either a molecular digital switch or a rheostat. Despite the cornucopia of high throughput phosphoproteomic data in the last decade, it remains unclear how many proteins are phosphorylated and how many phosphorylation sites (p-sites) can exist in total within a eukaryotic proteome. We present the first reliable estimates of the total number of phosphoproteins and phosphorylation sites (p-sites), for four eukaryotes (human, mouse, Arabidopsis, and yeast).<br> In all, 188 high-throughput phosphoproteomic datasets were filtered, compiled and studied along with two low-throughput compendia. Estimates of the number of phosphoproteins and p-sites were inferred by two methods: Capture-Recapture, and fitting the saturation curve of cumulative redundant vs. cumulative non-redundant phosphoproteins/p-sites. Estimates were also adjusted for different levels of noise within the individual datasets and other confounding factors. We estimate that in total, 13,000, 11,000 and 3,000 phosphoproteins and 230,000, 156,000 and 40,000 p-sites exist in human, mouse and yeast, respectively, whereas estimates for Arabidopsis were not as reliable. Most of the phosphoproteins have been discovered for human, mouse and yeast, while the dataset for Arabidopsis is still far from complete. The datasets for p-sites are not as close to saturation as those for phosphoproteins, Integration of the low-throughput data suggests that current high-throughput phosphoproteomics is capable of capturing 70-95% of total phosphoproteins, but only 40-60% of total p-sites.
提供机构:
GigaScience Database
创建时间:
2016-12-19
二维码
社区交流群
二维码
科研交流群
商业服务