five

Data for: Variant effect predictions capture some aspects of deep mutational scanning experiments

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://data.mendeley.com/datasets/2rwrkp7mfk
下载链接
链接失效反馈
官方服务:
资源简介:
Primary analysis files for bioRxiv manuscript with id 2019/859603 (https://www.biorxiv.org/content/10.1101/859603v1) to evaluate how common variant effect prediction methods capture effect determined by deep mutational scanning experiments. 'data' contains the deep mutational scanning data in a parsed format. See the manuscript for the original data sources which would then be processed with parseRawDatasets.py, followed by manual sequence mapping (resulting in the mapped_seqs.txt files) and then be processed with parseScores.py to result in the .npz files. 'predictionData' contains predictions from SIFT, PolyPhen-2, SNAP2 and Envision, parsed into .npz files. Additional folders are for dummy methods and while executing the below scripts. 'analysis' will contain most of the output files. See below for sample calls to reproduce e.g. Figure 1 from the paper. The scripts are written in Python3 and require, among others, numpy, pandas, scipy, sklearn, rpy2, svgutils and matplotlib. For all scripts the --normalization-scheme flag describes how the experimental scores are processed to fit on the same scale of values. The scheme used for the final manuscript is 'wt0_del_scaled' for deleterious effect variants and 'wt0_ben_scaled' for beneficial effect variants. For compareBinaryDMSToPredictions.py the --binarization-scheme flag describes how scores are binarized to neutral/effect. Possible values are the schemes outlined in the manuscript 'syn90', 'syn95' and 'syn99'.
创建时间:
2020-02-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作