Datasets used in the benchmarking study of MR methods

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://zenodo.org/record/10929571

下载链接

链接失效反馈

官方服务：

资源简介：

We conducted a benchmarking analysis of 16 summary-level data-based MR methods for causal inference with five real-world genetic datasets, focusing on three key aspects: type I error control, the accuracy of causal effect estimates, replicability, and power. The datasets used in the MR benchmarking study can be downloaded here: "dataset-GWASATLAS-negativecontrol.zip": the GWASATLAS dataset for evaluation of type I error control in confounding scenario (a): Population stratification "dataset-NealeLab-negativecontrol.zip": the Neale Lab dataset for evaluation of type I error control in confounding scenario (a): Population stratification; "dataset-PanUKBB-negativecontrol.zip": the Pan UKBB dataset for evaluation of type I error control in confounding scenario (a): Population stratification; "dataset-Pleiotropy-negativecontrol": the dataset used for evaluation of type I error control in confounding scenario (b): Pleiotropy; "dataset-familylevelconf-negativecontrol.zip": the dataset used for evaluation of type I error control in confounding scenario (c): Family-level confounders; "dataset_ukb-ukb.zip": the dataset used for evaluation of the accuracy of causal effect estimates; "dataset-LDL-CAD_clumped.zip": the dataset used for evaluation of replicability and power; Each of the datasets contains the following files: "Tested Trait pairs": the exposure-outcome trait pairs to be analyzed; "MRdat" refers to the summary statistics after performing IV selection (p-value < 5e-05) and PLINK LD clumping with a clumping window size of 1000kb and an r^2 threshold of 0.001. "bg_paras" are the estimated background parameters "Omega" and "C" which will be used for MR estimation in MR-APSS. Note: Supplemental Tables S1-S7.xlxs provide the download link for the original GWAS summary-level data for the traits used as exposures or outcomes. The formatted dataset after quality control can be accessible at our GitHub website (https://github.com/YangLabHKUST/MRbenchmarking). The details on quality control of GWAS summary statistics, formatting GWASs, and LD clumping for IV selection can be found on the MR-APSS software tutorial on the MR-APSS website (https://github.com/YangLabHKUST/MR-APSS). R code for running MR methods is also available at https://github.com/YangLabHKUST/MRbenchmarking.

创建时间：

2024-08-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集