five

DICTrank Is a Reliable Dataset for Cardiotoxicity Prediction Using Machine Learning Methods

收藏
Figshare2025-03-27 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/DICTrank_Is_a_Reliable_Dataset_for_Cardiotoxicity_Prediction_Using_Machine_Learning_Methods/28678454
下载链接
链接失效反馈
官方服务:
资源简介:
Drug-induced cardiotoxicity (DICT) is a significant challenge in drug development and public health. DICT can arise from various mechanisms; New Approach Methods (NAMs), including quantitative structure–activity relationships (QSARs), have been extensively developed to predict DICT based solely on individual mechanisms (e.g., hERG-related cardiotoxicity) due to the availability of datasets limited to specific mechanisms. While these efforts have significantly contributed to our understanding of cardiotoxicity, DICT assessment remains challenging, suggesting that approaches focusing on isolated mechanisms may not provide a comprehensive evaluation. To address this, we previously developed DICTrank, the largest dataset for assessing overall cardiotoxicity liability in humans based on FDA drug labels. In this study, we evaluated the utility of DICTrank for QSAR modeling using five machine learning methodsLogistic Regression (LR), K-Nearest Neighbors, Support Vector Machines, Random Forest (RF), and extreme gradient boosting (XGBoost)which vary in algorithmic complexity and explainability. To reflect real-world scenarios, models were trained on drugs approved before and within 2005 to predict the DICT risk of those approved thereafter. While we observed no clear association between prediction performance and model complexity, LR and XGBoost achieved the best results with DICTrank. Additionally, our significant-feature analyses with RF and XGBoost models provided novel insights into DICT mechanisms, revealing that drug properties associated with descriptors such as “structural and topological”, “polarizability”, and “electronegativity” contributed significantly to DICT. Moreover, we found that model performance varied by therapeutic category, suggesting the need to tailor models accordingly. In conclusion, our study demonstrated the robustness and reliability of DICTrank for cardiotoxicity prediction in humans using machine learning methods.
创建时间:
2025-03-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作