five

FUN-LDA scores for human genome assembly GRCh37

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7915634
下载链接
链接失效反馈
官方服务:
资源简介:
FUN-LDA is based on a Latent Dirichlet Allocation (LDA) model for predicting functional effects of non-coding genetic variants in a cell type and tissue-specific way by integrating diverse epigenetic annotations for specific cell types and tissues from large-scale genomics projects such as ENCODE and Roadmap Epigenomics. Using this unsupervised approach, we predict tissue-specific functional effects for every position in the human genome for 127 tissues and cell types in ENCODE and Roadmap Epigenomics. This online spreadsheet includes the information about the 127 Roadmap tissues in detail. Format The FUN-LDA scores are stored in the UCSC Genome Browser bigWig Track Format.  To extract FUN-LDA scores, the bigWigAverageOverBed utility is required. It can be downloaded from the Genome Browser website at https://hgdownload.cse.ucsc.edu/admin/exe. User should prepare a UCSC Genome Browser bed file, a tab-separated four-column file. The first column is the chromosome; the second is the zero-based coordinate of the position of interest; the third is that zero-based coordinate plus one; and the fourth is a unique identifier for the position. The command below is an example of extracting FUN-LDA scores in tissue E007 for the positions defined in a bed file, "input.bed". bigWigAverageOverBed E007.valley9.c89.bigwig input.bed output.tab It produces a six-column file, output.tab. The first column includes the unique position id defined in the input bed file. The last column, average over the covered bases, is the score for this position. Reference Daniel Backenroth, Zihuai He, Krzysztof Kiryluk, Valentina Boeva, Lynn Pethukova, Ekta Khurana, Angela Christiano, Joseph Buxbaum, Iuliana Ionita-Laza. FUN-LDA: A latent Dirichlet allocation model for predicting tissue-specific functional effects of noncoding variation: Methods and applications. American Journal of Human Genetics, 2018.
创建时间:
2023-05-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作