five

DeepC: Predicting chromatin interactions using megabase scaled deep neural networks and transfer learning (Tiled-C)

收藏
NIAID Data Ecosystem2026-04-25 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/SRP221610
下载链接
链接失效反馈
官方服务:
资源简介:
Understanding 3D genome structure requires high throughput, genome-wide approaches. However, assays for all vs. all chromatin interaction mapping are expensive and time consuming, which severely restricts their usage for large-scale mutagenesis screens or for mapping the impact of sequence variants. Computational models sophisticated enough to grasp the determinants of chromatin folding provide a unique window into the functional determinants of 3D genome structure as well as the effects of genome variation. A chromatin interaction predictor should work at the base pair level but also incorporate large-scale genomic context to simultaneously capture the large scale and intricate structures of chromatin architecture. Similarly, to be a flexible and generalisable approach it should also be applicable to data it has not been explicitly trained on. To develop a model with these properties, we designed a deep neuronal network (deepC) that utilizes transfer learning to accurately predict chromatin interactions from DNA sequence at megabase scale. The model generalizes well to unseen chromosomes and works across cell types, Hi-C data resolutions and a range of sequencing depths. DeepC integrates DNA sequence context on an unprecedented scale, bridging the different levels of resolution from base pairs to TADs. We demonstrate how this model allows us to investigate sequence determinants of chromatin folding at genome-wide scale and to predict the importance of regulatory elements and the impact of sequence variations. Overall design: To validate in silico predictions of chromatin interactions at high resolution comparable to Hi-C maps, we performed Tiled-C (Oudelaar and Beagrie et al.2019) over three regions in two cell lines (K562 (WIMM transgenics facility) and GM12878 - LCLs (Coriell)), from which predicted chromatin interactions have been generated. Tiled-C resolves the chromatin interactions over the selected regions at higher resolution and contrast then Hi-C currently can. Library preparation and NG Capture-C was performed in biological triplicates with four unique adapters being used for each replicate to increase sequencing depth and minimize PCR duplicates. These were pooled for maximum resolution. Tiled-C was performed with biotinylated double stranded oligonucleotides targeting sequences adjacent to DpnII sites at tiled restriction fragments over the regions of interest.
创建时间:
2020-10-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作