Enabling Open Machine Learning of Deoxyribonucleic Acid-Encoded Library Selections to Accelerate the Discovery of Small Molecule Protein Binders
收藏Figshare2025-10-06 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Enabling_Open_Machine_Learning_of_Deoxyribonucleic_Acid-Encoded_Library_Selections_to_Accelerate_the_Discovery_of_Small_Molecule_Protein_Binders/30284507
下载链接
链接失效反馈官方服务:
资源简介:
Machine learning (ML) is increasingly used in DNA-encoded library (DEL) screening for ligand discovery, but its success depends on access to suitable data sets, which are often proprietary and costly. To overcome this, we present the first fully open, automated DEL-ML framework using public DEL data sets and chemical fingerprints to enable reproducible, accessible drug discovery. Our workflowfrom model training to virtual screening and compound selectionrequires no human intervention. As a proof of concept, we identified binders for WDR91 by training ML models on the HitGen OpenDEL library (3B molecules) and screening the Enamine REAL Space library (37B molecules), yielding 50 candidates. Experimental testing confirmed seven novel binders with dissociation constants between 2.7–21 μM. Our open-source approach matches the performance of proprietary methods, demonstrating that public DEL data can support robust ML-driven ligand discovery and fostering transparency and broader community participation in drug development.
创建时间:
2025-10-06



