five

ToxCML: A Hybrid mfCoQ-RASAR-Based Platform Integrating Consensus QSAR and Read-Across for Comprehensive Multi-End Point Toxicity Assessment

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/ToxCML_A_Hybrid_mfCoQ-RASAR-Based_Platform_Integrating_Consensus_QSAR_and_Read-Across_for_Comprehensive_Multi-End_Point_Toxicity_Assessment/32001438
下载链接
链接失效反馈
官方服务:
资源简介:
Computational in silico methods offer a powerful alternative to animal-based toxicity testing, which remains time-consuming, expensive, and ethically challenging. In practice, QSAR and read-across (RA) are among the most widely explored approaches, yet both suffer from important limitations, including end point-specific modeling, restricted applicability domains, nonquantitative or subjective analogue selection, and sensitivity to biases in chemical space coverage. To address these limitations, this study introduces ToxCML, a large-scale hybrid multifeature consensus quantitative RA Structure–Activity Relationship (mfCoQ-RASAR) platform that unifies consensus QSAR and similarity-based consensus RA into a weight-optimized workflow for predicting 18 toxicity end points across 54,601 unique chemicals. The framework combines multiple molecular representations (MACCS, Morgan, atom pair fingerprints, RDKit fingerprints, and physicochemical descriptors) with machine-learning-based QSAR and k-NN RA models and incorporates tiered applicability-domain analysis and chemical-space mapping that indicate broadly overlapping training-test distributions and in-domain coverage generally exceeding 95% across end points. Across all end points and evaluation settings involving unseen test sets or external validation sets, the hybrid mfCoQ-RASAR models achieve strong discrimination and accuracy (AUC approximately 0.86–0.99; BACC 0.73–0.98), with a consistent performance hierarchy in which mfCoQ-RASAR provides the highest or near-highest performance, consensus QSAR remains highly competitive, and consensus RA, while weaker, still offers informative predictive and analogue-based discriminatory capability. These results indicate that our framework delivers reliable, chemically well-contextualized, and broadly applicable multiend point toxicity predictions for unseen compounds and may support large-scale toxicity screening, hazard prioritization, and efforts to reduce animal testing in regulatory and industrial settings. The ToxCML platform is accessible through a public web server available at http://cardiosim.metaheart.kr:8080/.
创建时间:
2026-04-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作