five

IMSGC Genome Wide Association Study of Multiple Sclerosis

收藏
NIAID Data Ecosystem2026-05-16 收录
下载链接:
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000139.v1.p1
下载链接
链接失效反馈
官方服务:
资源简介:
The goal of this study is to perform a comprehensive allelic and genotypic association analysis of the entire human genome in multiple sclerosis. The recent definitive linkage genome screen demonstrated that there is no other MS risk gene with an effect size anywhere near that of the MHC. However, linkage analysis is significantly hampered by reduced power in the face of heterogeneity and requires multiplex families, which also hampers acquiring an appropriate sample size. In contrast, genotyping 500K SNPs allows us to survey a significant amount of the genome (we estimate >70%) directly for allelic or genotypic association. This uses the improved power of association analysis and can also take advantage of the linkage disequilibrium relationships among SNPs to further increase power (e.g. haplotype analysis). Quality control and data analysis are significant challenges. We will initially perform substantial QC checks and analyze the data using both TDT and AFBAC approaches. Multigenic interactions will also be tested using MDR.]]> The Multiple Sclerosis International Genetics Consortium Diagnostic Criteria FormThe International Multiple Sclerosis Genetics Consortium (IMSGC) Requirements and Procedures for Data SubmissionQuality Control Sample and SNP filtering was done by investigators.TDT association analysis was replicated in pre-compute at NCBI. Quality control A total of 1,003 parent-affected child trios were initially identified for genotyping. After tests for DNA quality and quantity and Mendelian errors (using the Sequenom fingerprint SNP results), a total of 960 trios were genotyped using both Affymetrix chips. Results due to systematic errors and families with false paternity were removed. Preliminary analysis uncovered a relationship between the BRLMM-generated quality score (this score is the output for each SNP genotype call by the BRLMM program), and genotypes with Mendelian errors ( < 1% of genotypes had a quality score > 0.3 as defined in http://www.broad.mit.edu/gen_analysis/genotyping/brlmm_affy_ncrr.html, but they contributed over 20% of the genotypes with Mendelian errors). Therefore, any genotype with a BRLMM quality score > 0.3 was removed. The results then underwent quality control procedures implemented by a software package (WASP; Whole-genome Association Study Pipeline) that included the examination of SNP and sample genotyping efficiency, allele frequencies, checks of Mendelian errors, and sex-assignment discrepancies (based on available chromosome X genotypes), and tests of Hardy-Weinberg Equilibrium. We used the Graphical Representation of Relationships (GRR) program7 to test for relationships between samples and to detect sample switches, duplications, or contamination. Potential population stratification was examined as follows. From the 334,923 SNPs, we chose 1000 (with > 1Mb physical spacing) that had maximal chi-square differences in allele frequency between HapMap populations (500 for the CEU/YRI comparison, and 500 for the CEU/CHB+JPT comparison). The analysis International Multiple Sclerosis Genetic Consortium, N Engl J Med 2007;3576 (using the STRUCTURE program8) was seeded with data from unrelated individuals (60 CEU, 60 YRI, 45 CHB, 45 JPT). No significant stratification was observed. Further, we used the results of these analyses to determine if any individual appeared to be of non-European descent. We removed any individual who had a probability of < 90%, as given by the Structure program, of having all European-derived parents and grandparents. This procedure removed 34, 21, and 15 individuals from the screening, NIMH, and WTCCC datasets, respectively. All samples were examined for exclusion using the final SNP dataset (334,923 SNPs). These analyses identified 29 trios with problematic data leaving a final set of 931 trios for our primary screen. SNP exclusion criteria To advance to the whole-genome association study, SNPs had to pass the following quality control criteria: not mapping to multiple locations in the genome (3,605 SNPs excluded); a > 95% genotype call rate in the total 2,880 (960 trio) samples (58,435 SNPs excluded); allele frequency (MAF) > 0.005 in the parents (59,887 SNPs excluded); Hardy Weinberg equilibrium with P > 1 x 10-4 in parents (16,881 SNPs excluded); > 99% genotype call rate if the SNP had a MAF <10% (due to platform-specific observations demonstrating a difficulty in calling rare alleles) (74,492 SNPs excluded); and <10 total Mendelian errors (below the 95% cut-off assuming a binomial distribution for a 0.5% error rate) (13,136 SNPs excluded). In total, 334,923 SNPs survived these exclusion criteria and were included in the analysis. ]]>An MS specialist confirmed an MS diagnosis for collection site participants in this study. The diagnosis of MS was made according to the McDonald criteria after review of all available clinical data. Complete clinical data was available for over 95.0% of patients. Disease course was recorded for patients at entry to study as either relapsing-remitting (RR), secondary progressive (SP), primary progressive (PP), or relapsing-progressive (PR) 4. Disability was also assessed at entry with the Expanded Disability Status Scale (EDSS) 5. SPMS was characterized by six months of worsening neurological disability not explained by relapse and measured as a deterioration of either: i) one or more points on the EDSS in patients with an EDSS less than 6, or ii) a one-half point or more for those with an EDSS > 6. PPMS was characterized both by 1) progressive clinical worsening for more than 12 months from symptom onset without any relapses, and 2) abnormal cerebrospinal fluid (CSF) as defined by the presence of two or more oligoclonal bands or elevated IgG index in the CSF. If acute relapses were superimposed on this steadily progressive course, patients were considered to have PRMS. CIS was characterized as the first, well-defined neurological event lasting more than 48 hours, and which involved the optic nerve, spinal cord, brainstem, or cerebellum. In patients presenting with a CIS, the presence of two or more hyper-intense lesions on a T2-weighted MRI sequence was also required for enrollment into the study. A small number of subjects with clinically isolated syndrome (CIS) were included among the US samples used for this study. For the purpose of the analysis, they were treated as cases of "relapsing-remitting" cases of MS since they represent one tail of the distribution of the demyelinating disease spectrum and a majority of them will go on to have a second attack or show evidence of disease progression.]]>
创建时间:
2009-07-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作