five

Powerful SNP Set Analysis for Case-Control GenomeWide Association Studies

收藏
NIAID Data Ecosystem2026-03-06 收录
下载链接:
https://doi.org/10.7910/DVN/DIL3MU
下载链接
链接失效反馈
官方服务:
资源简介:
Genome wide association studies (GWAS) have emerged as popular tools for identifying genetic variants that are associated with disease risk. Standard analysis of a case-control GWAS involves assessing the association between each individual genotyped SNP and disease risk. However, this approach suffers from limited reproducibility and difficulties in detecting multi-SNP and epistatic effects. As an alternative analytical strategy, we propose grouping SNPs together into SNP sets based on proximity to genomic features such as genes or haplotype blocks, and then testing the joint effect of each SNP set. Testing of each SNP set proceeds via the logistic kernel machine based test which is based on a statistical framework that allows for flexible modeling of epistatic and nonlinear SNP effects. This flexibility as well as the ability to naturally adjust for covariate effects are important features of our test that make it appealing compared to individual SNP tests and existing multi-marker tests. Using simulated data based on the International HapMap Project, we show that SNP set testing can have improved power over standard individual SNP analysis under a wide range of settings. In particular, we find that our approach has higher power than individual SNP analysis when the median correlation between disease susceptibility variant and the genotyped SNPs is moderate to high. When the correlation is low, both individual SNP analysis and the SNP set analysis tend to have low power. We apply SNP set analysis to analyze the CGEMS breast cancer GWAS discovery phase data.
创建时间:
2010-10-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作