five

MDD-Molecular Dynamics Dataset: Collection of protein-ligand complex simulations

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/11172814
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset is part of the paper: https://chemrxiv.org/engage/chemrxiv/article-details/664c73f6418a5379b0de8152. This dataset consists of molecular dynamics (MD) simulations of 862 unique protein-ligand complexes, covering a wide range of protein families and diverse chemical classes of ligands. It is derived from publicly available repositories and represents the largest single source of MD simulations to date. All protein-ligand complexes included in the dataset were prepared following a standardized protocol. Missing atoms in the protein structures were added using the PDBFixer tool. The protein targets were parameterized using the AMBER99SB-ILDN force field, while ligands were parameterized with the ANTECHAMBER module within the ACPYPE tool. Ligand partial charges were determined to match the quantum-mechanically generated electrostatic potential via the Restrained Electrostatic Potential (RESP) method, and the remaining parameters were set using the GAFF2 force field. The molecular dynamics simulations were performed using GROMACS. The simulations were configured in a cubic simulation box with periodic boundary conditions and employed a TIP3P water model within an electrostatically neutral environment. The simulation protocol included an initial minimization cycle, followed by temperature equilibration in the NVT ensemble and pressure equilibration in the NPT ensemble. Production simulations were conducted over a period of 200 ns, with a timestep of 100 ps. Constructing a large, representative set of MD simulations poses challenges due to the high computational costs and complexities associated with preparing molecular systems. Moreover, given the limited number of suitable training examples (complexes) and the large volume of MD data from each simulation, careful filtering and feature selection are crucial. This dataset is valuable for exploring how molecular dynamics simulation data can be integrated with protein-ligand binding affinity prediction tasks, an essential component of in silico drug discovery pipelines. MD simulations, in particular, offer a dynamic view by illustrating the temporal interactions within protein-ligand complexes, potentially providing additional insights for affinity and specificity estimates.
创建时间:
2024-05-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作