Using buccal methylomic data to create novel predictors of demographic, health, and lifestyle variables
收藏Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/m4zjkxss8f/1
下载链接
链接失效反馈官方服务:
资源简介:
In human blood, it has been demonstrated that methylomic information can be used to predict smoking status, alcohol intake, and chronological age. While it is possible to robustly predict chronological age using DNA methylation information derived from buccal tissue, it remains to be determined if other variables can be directly predicted in cheek swabs. Here, we demonstrate that classifiers for smoking status and race/ethnicity can be built in a buccal methylomic dataset derived from 8,045 adults spanning an age range of 18 – 93 years. Furthermore, we build novel regressors for body mass index, alcohol intake, and chronological age. For each of these models, we identify the 1,000 most important CpGs and perform enrichment analyses on them to expose associated biological pathways and transcription factor targets. Furthermore, we explore how the architecture of an epigenetic aging clock – specifically how many hidden layers are present – influences model accuracy. Finally, we build a proof-of-concept explainable deep learning model that connects DNA methylation sites annotated to genes to Reactome pathways. These pathways are then used to estimate epigenetic age, a feature that inherently provides interpretability. All together, these findings further emphasize the usability of buccal data for epigenetic predictions.



