Enabling Open Machine Learning of Deoxyribonucleic Acid-Encoded Library Selections to Accelerate the Discovery of Small Molecule Protein Binders
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Enabling_Open_Machine_Learning_of_Deoxyribonucleic_Acid-Encoded_Library_Selections_to_Accelerate_the_Discovery_of_Small_Molecule_Protein_Binders/30284507
下载链接
链接失效反馈官方服务:
资源简介:
Machine learning
(ML) is increasingly used in DNA-encoded library
(DEL) screening for ligand discovery, but its success depends on access
to suitable data sets, which are often proprietary and costly. To
overcome this, we present the first fully open, automated DEL-ML framework
using public DEL data sets and chemical fingerprints to enable reproducible,
accessible drug discovery. Our workflowfrom model training
to virtual screening and compound selectionrequires no human
intervention. As a proof of concept, we identified binders for WDR91
by training ML models on the HitGen OpenDEL library (3B molecules)
and screening the Enamine REAL Space library (37B molecules), yielding
50 candidates. Experimental testing confirmed seven novel binders
with dissociation constants between 2.7–21 μM. Our open-source
approach matches the performance of proprietary methods, demonstrating
that public DEL data can support robust ML-driven ligand discovery
and fostering transparency and broader community participation in
drug development.
创建时间:
2025-10-06



