five

Accurate age prediction from blood using a small set of DNA methylation sites and a cohort-based machine learning algorithm

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE207605
下载链接
链接失效反馈
官方服务:
资源简介:
Chronological age prediction from DNA methylation sheds light on human aging, indicates poor health and predicts lifespan. Previous studies developed methylation clocks based on linear regression models on methylation array data. While accurate, these models are limited to fixed-rate changes in methylation levels across age. Moreover, the high cost of methylation arrays, compared to targeted-PCR sequencing, hinders widespread utility of such predictors. We present an AI-based alternative termed GP-age, which uses a non-parametric approach based on Gaussian Process Regression of a large cohort of ~12K blood methylomes. Given a new blood sample, methylation levels are compared to the cohort samples, which are then weighted to predict the query age. Using only 30 CpG sites, our approach outperforms state-of-the-art methylation clocks that use hundreds of sites, with a median error of 2.1 years (on held-out data). Our model was also applied to sequencing-based data yielding highly accurate predictions. Overall, we provide an accessible alternative to current array-based methylation clocks, with future applications in aging research, forensic profiling, and monitoring epigenetic processes in transplantation medicine and cancer. Re-published data of Bisulphite converted DNA from 11,910 whole-blood samples hybridised to the Illumina Infinium 450k Human Methylation Beadchip. The processed beta values of age-correlated 2,374 CpG sites and the ages of the donors are included.
创建时间:
2023-10-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作