Deciphering Regulatory SNPs from ATAC-seq
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE145245
下载链接
链接失效反馈官方服务:
资源简介:
Background: Expression quantitative trait loci (eQTL) studies are a valuable approach for identifying genetic variants correlated with gene expression. However, identifying the causal variants is challenging due to linkage disequilibrium amongst variants in the same haplotype block. In this study, we aim to identify functional SNPs in key regulatory regions that alter transcriptional regulation and thus, potentially impact cellular function. The majority of disease-associated single-nucleotide polymorphisms (SNPs) are located in regulatory regions, which can result in allele-specific binding (ASB) of transcription factors and differential expression of the target gene alleles. Here, we present regSNPs-ASB, a generalized linear model-based approach to accurately identify regulatory SNPs that are located in transcription factor binding sites from ATAC-seq data. Results: Using regSNPs-ASB, we identified 53 regulatory SNPs in human MCF-7 breast cancer cells and 125 regulatory SNPs in human mesenchymal stem cells (MSC). By integrating the regSNPs-ASB output with RNA-seq experimental data and publicly available chromatin interaction data from MCF-7 cells, we found that these 53 regulatory SNPs were associated with 74 potential target genes and that 32 (43%) of these genes showed significant allele-specific expression (ASE). By comparing all of the MCF-7 and MSC regulatory SNPs to the eQTLs in the Genome-Tissue Expression (GTEx) Project database, we found that 30% (16/53) of the regulatory SNPs in MCF-7 and 43% (52/122) of the regulatory SNPs in MSC were also eQTLs. The enrichment of regulatory SNPs in eQTLs indicated that many of them are likely responsible for allelic differences in gene expression (chi-square test, p-value < 0.01). In sum, we conclude that regSNPs-ASB is a useful tool for identifying causal variants from ATAC-seq data. This new computational tool will enable efficient prioritization of genetic variants identified as eQTL for further studies to validate their causal regulatory function. Ultimately, identifying causal genetic variants will further our understanding of the underlying molecular mechanisms of disease and the eventual development of potential therapeutic targets. ATAC-seq assays were performed on two human cell lines, MCF-7 breast cancer cells and MSC (Mesenchymal Stem Cells). We generated 3 ATAC-seq libraries in MCF-7 cells and each individual libraries were derived from three technical replicates. We also generated 3 ATAC-seq libraries in MSC and each individual libraries were derived from two technical replicates. RNA-seq assays were performed on MCF-7 breast cancer cells. We generated 2 RNA-seq libraries which were each sequenced in three lanes to create three technical replicates per library.
创建时间:
2020-02-27



