five

Summary of tolerated sequence prediction performance on different datasets using the generalized protocol described here.

收藏
Figshare2015-12-02 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/_Summary_of_tolerated_sequence_prediction_performance_on_different_datasets_using_the_generalized_protocol_described_here_/426279
下载链接
链接失效反馈
官方服务:
资源简介:
116 designed hGH amino acid positions as defined in [23] and shown in Figure 3.2All designed hGH amino acid positions shown in Figure S4.3Performance metrics based on position weight matrices from Smith & Kortemme 2010 [35].Scoring metrics are used as defined previously [35]. Fraction Top 5 gives the average fraction (for every position) of amino acids with phage display frequencies ≥10% in the predicted top 5 ranked amino acids. AAD gives the average absolute difference in amino acid frequency between prediction and phage display. AUC gives the area under receiver operator characteristic curve, with true positives defined as those with phage display frequencies ≥10%. Rank top gives the average rank of the most frequently observed amino acid in phage display. The table gives results from one set of predictions as described in Methods. To gauge the variability, we repeated the predictions three times and calculated the standard deviation of the scoring metrics. The absolute standard deviations and dynamic ranges are 0.4/4.32 (Bits Predicted), 1.9/100 (Fraction Top 5), 0.4/10 (AAD), 0.006/1 (AUC), and 0.2/19 (Rank Top). As a percentage of the dynamic range of a given metric, the average standard deviations (over the first 5 rows) were: 0.9% (Bits Predicted), 1.9% (Fraction Top 5), 0.4% (AAD), 0.6% (AUC), and 1.1% (Rank Top).
创建时间:
2015-12-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作