five

File S1 - Genome-Wide Identification of Chromatin Transitional Regions Reveals Diverse Mechanisms Defining the Boundary of Facultative Heterochromatin

收藏
NIAID Data Ecosystem2026-03-07 收录
下载链接:
https://figshare.com/articles/dataset/_Genome_Wide_Identification_of_Chromatin_Transitional_Regions_Reveals_Diverse_Mechanisms_Defining_the_Boundary_of_Facultative_Heterochromatin_/735652
下载链接
链接失效反馈
官方服务:
资源简介:
Figures S1–S8 and Tables S1–S2. Figure S1. Construction of empirical positive and negative evaluation datasets. For the optimization of the parameters, we set up the empirical positive and negative evaluation datasets for CTR. For the positive dataset, we took the genomic loci which, (1) show clear H3K27me3 enrichment transition (H3K27me3 tags in the 4 kb region on the enriched and depleted side should be greater than 1.5 fold or less than 25% of genome average, respectively), (2) has a single enrichment region for one of the three known insulator proteins Su(Hw), dCTCF, and BEAF-32 and respective co-factors (CP190 and Mod(mdg4) for Su(Hw), and CP190 for dCTCF and BEAF-32) in the vicinity of the transition. With these criteria, we got 81 positive loci, among which 66, 8 and 7 are bound by BEAF-32, dCTCF and Su(Hw) respectively. For CTR-negative dataset, we used the following criteria, (1) a gene with corresponding genomic region greater than 5 kb and does not overlap with any other gene(s), and (2) the gene should have no alternative splicing, and (3) the average H3K27me3 ChIP-Seq tags in the transcribed region of the gene should be more than two-fold of the whole genome average. We scanned all the annotated genes, 159 of them went through this unbiased selection for S2 cells. To avoid possible chromatin barriers at TSS (transcription start site) or TES (transcription end site), we took the central 60% regions of each gene to form the negative dataset. Sample regions in positive and negative evaluation datasets. (A) A sample positive region on chromosome 3R. The region has enrichment of DNA-binding insulator protein Su(Hw) and co-factors CP190 and Mod(mdg4), while it does not have enrichment of other two DNA-binding insulator proteins (dCTCF, BEAF-32). In addition, there is dramatic transition in terms of H3K27me3 signal around this region. The dashed line indicates the midpoint of Su(Hw) enriched region, and it was also taken as the coordinate of this positive region. (B) A sample negative region on chromosome 2L. The entire transcribed region of the gene CG4341 is covered by high level of H3K27me3. The central 60% of this gene (the region between two dashed lines) was taken as a negative region. Figure S2. Comparison of CTRICS with SICER and RSEG. (A) The Venn diagram shows the number of CTRs (predicted by CTRICS) that are overlapping with the chromatin boundaries predicted by the other two methods in S2 cells. (B) False-positive and false-negative rates for different methods based on the empirical evaluation datasets. (C) A region shows several examples of CTRs predicted by CTRICS (red bar), H3K27me3 boundaries predicted by RSEG (grey bar), and H3K27me3 domains predicted by SICER. RSEG missed the three boundaries shown in the blue dashed block, which can be corroborated with RNA-Seq and H3K4me3 ChIP-chip data. Figure S3. Sequencing depth analysis. In order to test if the H3K27me3 ChIP-Seq dataset has reached saturation status and if sequencing depth has any influence on the CTR prediction, we conducted the sequencing depth analysis. We first randomly extracted a series of subsamples (10%, 20%, 30%, and so on until 90% of the original tags) from the H3K27me3 ChIP-Seq dataset without replacement. We then identified chromatin transitional regions in each subsample using CTRICS with default parameters. The x-axis of the plot represents the percentage of subsample tags compared to the total tags (∼2.8 x106), and y-axis indicates the number of CTRs identified. Figure S4. Principal component analysis of CTRs based on association with the 15 proteins. (A) Percentage of total variance accounted for by individual principal components. (B) Two-dimensional projections onto the first three principal components. Different colors of the dots represent different groups of CTRs corresponding to the groups shown in the hierarchical clustering result (Figure 3A). Figure S5. Genomic distribution of CTRs. The average intensities of RNA polymerase II (A) and H3K4me3 (B) around individual groups of CTR. (C) The distribution of CTRs in each group. Figure S6. An example of 2 CTRs (red bars) predicted by CTRICS in human HeLa cells. The panel below CTR shows H3K27me3 domains predicted in Cuddapah et al. 2009. CTRICS identifies the boundaries with a significant drop of H3K27me3 level, but ignores the boundaries with minor changes in H3K27me3 signal. Figure S7. Binding patterns of co-factors are different for CTR-associated and euchromatic binding sites. Binding patterns of insulator proteins (A) and their co-factors (B) around CTR-associated (solid curve), heterochromatic (dotted curve) and euchromatic (break curve) binding sites in Drosophila S2 cells. For CTR-associated binding sites, negative and positive distances denote euchromatic and heterochromatic side. Figure S8. Flowchart of CTRICS. The green characters are the parameters needed in each step. Table S1. The list of ChIP-Chip profiles used in the clustering analysis. Table S2. The list of datasets used in this study. (DOC)
创建时间:
2013-06-28
二维码
社区交流群
二维码
科研交流群
商业服务