five

Novel Methods for Prioritizing “Close-In” Analogs from Structure–Activity Relationship Matrices

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://figshare.com/articles/dataset/Novel_Methods_for_Prioritizing_Close-In_Analogs_from_Structure_Activity_Relationship_Matrices/5203528
下载链接
链接失效反馈
官方服务:
资源简介:
Here we describe the development of novel methods for compound evaluation and prioritization based on the structure–activity relationship matrix (SARM) framework. The SARM data structure allows automatic and exhaustive extraction of SAR patterns from data sets and their organization into a chemically intuitive scaffold/functional-group format. While SARMs have been used in the retrospective analysis of SAR discontinuity and identifying underexplored regions of chemistry space, there have been only a few attempts to apply SARMs prospectively in the prioritization of “close-in” analogs. In this work, three new ways of prioritizing virtual compounds based on SARMs are described: (1) matrix pattern-based prioritization, (2) similarity weighted, matrix pattern-based prioritization, and (3) analysis of variance based prioritization (ANV). All of these methods yielded high predictive power for six benchmark data sets (prediction accuracy R2 range from 0.63 to 0.82), yielding confidence in their application to new design ideas. In particular, the ANV method outperformed the previously reported SARM based method for five out of the six data sets tested. The impact of various SARM parameters were investigated and the reasons why SARM-based compound prioritization methods provide higher predictive power are discussed.
创建时间:
2017-07-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作