five

Combining High-Throughput Screening and In Silico Modeling to Derisk Novel Agrochemicals for Androgen Receptor Binding

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Combining_High-Throughput_Screening_and_In_Silico_Modeling_to_Derisk_Novel_Agrochemicals_for_Androgen_Receptor_Binding/31698490
下载链接
链接失效反馈
官方服务:
资源简介:
Androgen receptor (AR) modulation is a critical safety concern for environmental chemicals, including agrochemicals, due to its role in endocrine disruption. Existing public data sets for AR modulation are limited in size and diversity. Here, we report a large-scale high-throughput screening (HTS) campaign assessing AR binding for over 72,000 compounds from an agrochemical library using a fluorescence polarization displacement assay. Confirmatory dose–response testing identified 4,183 AR binders (5.7% hit rate) with substantial structural diversity and numerous novel scaffolds. To enable predictive modeling, we curated an unrestricted data set of 24,953 compounds with associated activity data, which we publish as part of this paper. Using this data set, we trained machine learning models based on molecular 1D and 2D descriptors and fingerprints. Gradient-boosted trees achieved the best performance, with a balanced accuracy of 0.77 and a negative predictive value of 0.98, making the model suitable for derisking large virtual libraries. External validation on existing publicly available AR data sets (CoMPARA and PubChem) demonstrated reasonable transferability (balanced accuracy 0.66 and 0.72), overcoming differences in experimental methods and composition of the compound sets. Our findings demonstrate the utility of combining HTS with machine learning for early safety assessment and provide a benchmark data set to advance AR binding prediction, complementing existing data sets
创建时间:
2026-03-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作