five

Datasets used in the benchmarking study of MR methods

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10929571
下载链接
链接失效反馈
官方服务:
资源简介:
We conducted a benchmarking analysis of 16 summary-level data-based MR methods for causal inference with five real-world genetic datasets, focusing on three key aspects: type I error control, the accuracy of causal effect estimates, replicability, and power. The datasets used in the MR benchmarking study can be downloaded here: "dataset-GWASATLAS-negativecontrol.zip":  the GWASATLAS dataset for evaluation of type I error control in confounding scenario (a): Population stratification "dataset-NealeLab-negativecontrol.zip": the Neale Lab dataset for evaluation of type I error control in confounding scenario (a): Population stratification; "dataset-PanUKBB-negativecontrol.zip": the Pan UKBB dataset for evaluation of type I error control in confounding scenario (a): Population stratification; "dataset-Pleiotropy-negativecontrol": the dataset  used for evaluation of type I error control in confounding scenario (b): Pleiotropy; "dataset-familylevelconf-negativecontrol.zip": the dataset used for evaluation of type I error control in confounding scenario (c): Family-level confounders; "dataset_ukb-ukb.zip": the dataset used for evaluation of the accuracy of causal effect estimates; "dataset-LDL-CAD_clumped.zip": the dataset used for evaluation of replicability and power; Each of the datasets contains the following files:  "Tested Trait pairs": the exposure-outcome trait pairs to be analyzed; "MRdat" refers to the summary statistics after performing IV selection (p-value < 5e-05) and PLINK LD clumping with a clumping window size of 1000kb and an r^2 threshold of 0.001. "bg_paras" are the estimated background parameters "Omega" and "C" which will be used for MR estimation in MR-APSS. Note: Supplemental Tables S1-S7.xlxs  provide the download link for the original GWAS summary-level data for the traits used as exposures or outcomes. The formatted dataset after quality control can be accessible at our GitHub website (https://github.com/YangLabHKUST/MRbenchmarking). The details on quality control of GWAS summary statistics, formatting GWASs, and LD clumping for IV selection can be found on the MR-APSS software tutorial on the MR-APSS  website (https://github.com/YangLabHKUST/MR-APSS). R code for running MR methods is also available at https://github.com/YangLabHKUST/MRbenchmarking.
创建时间:
2024-08-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作