five

GREEN-DB: Genomic Regulatory Elements ENcyclopedia

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/3981032
下载链接
链接失效反馈
官方服务:
资源简介:
GREEN-DB is a comprehensive collection of 2.4 million regulatory elements in the human genome collected from previously published databases, high-throughput screenings and functional studies. Regulatory regions are classified as enhancers, promoters, silencers, bivalent and information on the controlled gene(s), tissue(s) and associated phenotype(s) are provided for each element when possible. We also calculated a variation constraint metric (range 0-1) for these regulatory regions and showed that genes controlled by constrained regions are enriched for disease-associated genes and essential genes from mouse knock-out screenings. The database also includes information from ENCODE TFBS and DNase peaks; ultra-conserved non-coding elements (UCNE), super-enhancers (dbSuper) and TAD domains (TAD-KB). This release includes 5 files: GREEN-DB_v2.5.db.gz: The full database in SQLite format GRCh37_GREEN-DB.bed.gz[.csi]: A indexed BED file using GRCh37 genome coordinates describing the regulatory regions and associated information useful for variant annotations (controlled genes, closest gene/TSS, constraint metric). GRCh38_GREEN-DB.bed.gz[.csi]: A indexed BED file using GRCh38 genome coordinates describing the regulatory regions and associated information useful for variant annotations (controlled genes, closest gene/TSS, constraint metric). To annotate a VCF file with information from GREEN-DB you can use the bed files and our tool GREEN-VARAN (https://github.com/edg1983/GREEN-VARAN). For more information on the GREEN-DB please refer to our publication (https://doi.org/10.1101/2020.09.17.301960) and to online documentation (https://green-varan.readthedocs.io/en/latest/) GREEN-DB is free to use for academic users, please refer to the attached LICENSE file.   Changes from the previous version: - We fixed an issue with alias symbols conversion that caused a small fraction of region-gene links to point to the wrong gene - Due to the problem above, we removed any region-gene link where the region and the controlled gene were located on different chromosomes - GREEN-DB now includes also TAD domain information from TAD-KB (http://dna.cs.miami.edu/TADKB/) and region-gene interactions are now annotated for occurrence within the same TAD - Better constraint metric model that now takes into account overlap with exonic regions - In addition to the closest gene, an annotation for the closest TSS and its distance is now provided
创建时间:
2024-07-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作