Open Chromatin Guided Interpretable Machine Learning Reveals Cancer-Specific Chromatin Features in Cell-free DNA
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE279542
下载链接
链接失效反馈官方服务:
资源简介:
Cell-free DNAs (cfDNAs) are DNA fragments found in blood, originating mainly from immune cells in healthy individuals and from both immune and cancer cells in cancer patients. While cancer-derived cfDNAs carry mutations, they also retain epigenetic features such as DNA methylation and nucleosome positioning. In this study, we examine nucleosome enrichment patterns in cfDNAs from breast and pancreatic cancer patients and find significant enrichment at open chromatin regions. Differential enrichment is observed not only at cancer cell type specific ATAC-seq peaks but also at CD4? T cell specific peaks, suggesting both tumor- and immune-derived contributions to the cfDNA signal. To leverage these patterns, we apply an interpretable machine learning model (XGBoost) trained on cell type specific open chromatin regions. This approach improves cancer detection accuracy and highlights key genomic loci associated with the disease state. Our pipeline provides a robust and interpretable framework for cfDNA-based cancer detection. Cell-free DNAs were isolated from patients with breast and pancreatic cancer. Enrichment patterns were analyzed at accessible chromatin regions defined by ATAC-seq. Subsequently, the XGBoost machine learning model was employed to interpret these patterns and identify cancer-specific features. For the Breast_Cancer_cfDNA_2 dataset, additional deep sequencing was performed, and these data have been deposited under the accession Breast_Cancer_cfDNA_2_DeepSeq.
创建时间:
2025-09-08



