Data for: Variant effect predictions capture some aspects of deep mutational scanning experiments
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://data.mendeley.com/datasets/2rwrkp7mfk
下载链接
链接失效反馈官方服务:
资源简介:
Primary analysis files for bioRxiv manuscript with id 2019/859603 (https://www.biorxiv.org/content/10.1101/859603v1) to evaluate how common variant effect prediction methods capture effect determined by deep mutational scanning experiments.
'data' contains the deep mutational scanning data in a parsed format. See the manuscript for the original data sources which would then be processed with parseRawDatasets.py, followed by manual sequence mapping (resulting in the mapped_seqs.txt files) and then be processed with parseScores.py to result in the .npz files.
'predictionData' contains predictions from SIFT, PolyPhen-2, SNAP2 and Envision, parsed into .npz files. Additional folders are for dummy methods and while executing the below scripts.
'analysis' will contain most of the output files.
See below for sample calls to reproduce e.g. Figure 1 from the paper. The scripts are written in Python3 and require, among others, numpy, pandas, scipy, sklearn, rpy2, svgutils and matplotlib.
For all scripts the --normalization-scheme flag describes how the experimental scores are processed to fit on the same scale of values. The scheme used for the final manuscript is 'wt0_del_scaled' for deleterious effect variants and 'wt0_ben_scaled' for beneficial effect variants.
For compareBinaryDMSToPredictions.py the --binarization-scheme flag describes how scores are binarized to neutral/effect. Possible values are the schemes outlined in the manuscript 'syn90', 'syn95' and 'syn99'.
创建时间:
2020-02-14



