five

Accelerating Rational Crystal Habit Design via Interpretable Mechanism-Guided Machine Learning Framework

收藏
Figshare2026-02-13 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Accelerating_Rational_Crystal_Habit_Design_via_Interpretable_Mechanism-Guided_Machine_Learning_Framework/31331903
下载链接
链接失效反馈
官方服务:
资源简介:
The pharmaceutical crystal habit significantly influences downstream processing and product quality. However, traditional experimental screening methods rely heavily on empirical knowledge and trial-and-error experimentation, demanding substantial resources and time. Here we present an interpretable machine learning framework integrating molecular descriptors with quantum chemical (QC) or thermodynamic (HSP, PC-SAFT) parameters to predict crystal habits. Using a curated database of 418 entries covering 153 APIs and 41 solvents, the XGBoost model demonstrated superior performance among seven algorithms. Models augmented with QC or thermodynamic descriptors significantly improved prediction, achieving AUC > 0.88. SHAP analysis revealed dipole moments and solubility parameter differences as key determinants of crystal habit, enhancing interpretability. External validation on six compounds in seven solvents confirmed that models with additional mechanistic descriptors consistently outperformed the molecular-only baseline, with the Molecular+QC model showing the best overall performance. This proposal paves the way to the rational design of crystal engineering strategies for habit prediction and control, and offers a general framework that can readily be extended to other crystallographic applications, as well as broader domains, including drug discovery and materials science.
创建时间:
2026-02-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作