Variant data.

Figshare2025-09-05 更新2026-04-28 收录

下载链接：

https://figshare.com/articles/dataset/Variant_data_/30065760

下载链接

链接失效反馈

官方服务：

资源简介：

Data used to calculate Z-scores by variant, select escape variants and normalized fitness scores by variant. Data are arranged by variant and labeled either with mutation, or if synonymous wild-type, by DNA sequence (rows). Columns are labeled by treatment, with activity referring to PLpro activity after doxycycline induction, leaky referring to activity in the absence of doxycycline, PF, Jun and WEHI referring to PLpro activity in the presence of PF-07957472, Jun12682 and WEHI-P8 respectively. The sheet labeled “Raw Data - Z-Score calculation” outlines how raw DiMSum fitness scores were converted to Z-scores and gives the mean read count of the input condition (FRET+) and the DiMSum error estimations. Also included are GISAID observations and whether each variant is accessible by a single base pair mutation. The sheet labeled “Confident escape variants” provides context for how putative escape variants were filtered based on mean read counts (must be over 10) and DiMSum errors (must be below 1 for activity and leaky errors and below 0.8 for inhibitor treatments). Z-scores over 2 (2 SD above wildtype) were considered escape variants. The “conclusion” column labels whether a variant is categorized as escape (escape) or susceptible (sensitive) to a particular treatment or undetermined due to poor data quality (fail). Also included are GISAID observations and whether each variant is accessible by a single base pair mutation. The sheet labeled “Normalized fitness and SE” contains normalized DiMSum fitness scores by variant calculated from Raw Fitness scores that can be found in the sheet labeled “Raw data - Z-score calculation”. (XLSX)

创建时间：

2025-09-05

5,000+

优质数据集

54 个

任务类型

进入经典数据集