Enhancers display sequence flexibility constrained by transcription factor motif syntax [Drosophila random variant STARR-seq]

NIAID Data Ecosystem2026-03-14 收录

下载链接：

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE211655

下载链接

链接失效反馈

官方服务：

资源简介：

The information about when and where each gene is to be expressed is mainly encoded in the DNA sequence of enhancers, sequence elements that comprise binding sites (motifs) for different transcription factors (TFs). Most of the research on enhancer sequences has been focused on TF motif presence, while the enhancer syntax, i.e. the flexibility of important motif positions and how the sequence context modulates the activity of TF motifs, remain poorly understood. Here, we explore the rules of enhancer syntax by a two-pronged approach in Drosophila melanogaster S2 cells: we (1) replace important motifs by an exhaustive set of all possible 65,536 eight-nucleotide-long random sequences and (2) paste eight important TF motif types into 763 motif positions within 496 enhancers. These complementary strategies reveal that enhancers display constrained sequence flexibility and the context-specific modulation of motif function. Important motifs can be functionally replaced by hundreds of sequences constituting several distinct motif types, but only a fraction of all possible sequences and motif types restore enhancer activity. Moreover, TF motifs contribute with different intrinsic strengths that are strongly modulated by the enhancer sequence context (the flanking sequence, presence and diversity of other motif types, and distance between motifs), such that not all motif types can work in all positions. Constrained sequence flexibility and the context-specific modulation of motif function are also hallmarks of human enhancers and TF motifs, as we demonstrate experimentally. Overall, these two general principles of enhancer sequences are important to understand and predict enhancer function during development, evolution and in disease. UMI-STARR-seq was performed in S2 cells using a enhancer libraries where we replace important motifs by an exhaustive set of all possible 65,536 eight-nucleotide-long random sequences. All experiments were performed in 2 biological replicates.

创建时间：

2022-12-15

5,000+

优质数据集

54 个

任务类型

进入经典数据集