five

Molecular Dynamics Fingerprints (MDFP): Machine Learning from MD Data To Predict Free-Energy Differences

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://figshare.com/articles/dataset/Molecular_Dynamics_Fingerprints_MDFP_Machine_Learning_from_MD_Data_To_Predict_Free-Energy_Differences/4868894
下载链接
链接失效反馈
官方服务:
资源简介:
While the use of machine-learning (ML) techniques is well established in cheminformatics for the prediction of physicochemical properties and binding affinities, the training of ML models based on data from molecular dynamics (MD) simulations remains largely unexplored. Here, we present a fingerprint termed MDFP which is constructed from the distributions of properties such as potential-energy components, radius of gyration, and solvent-accessible surface area extracted from MD simulations. The corresponding fingerprint elements are the first two statistical moments of the distributions and the median. By considering not only the average but also the spread of the distribution in the fingerprint, some degree of entropic information is encoded. Short MD simulations of the molecules in water (and in vacuum) are used to generate MDFP. These are further combined with simple counts based on the 2D structure of the molecules into MDFP+. The resulting information-rich MDFP+ is used to train ML models for the prediction of solvation free energies in five different solvents (water, octanol, chloroform, hexadecane, and cyclohexane) as well as partition coefficients in octanol/water, hexadecane/water, and cyclohexane/water. The approach is easy to implement and computationally relatively inexpensive. Yet, it performs similarly well compared to more rigorous MD-based free-energy methods such as free-energy perturbation (FEP) as well as end-state methods such as linear interaction energy (LIE), the conductor-like screening model for realistic solvation (COSMO-RS), and the SMx family of solvation models.
创建时间:
2017-04-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作