An effective machine learning model for rat acute oral toxicity prediction of emerging chemicals: multi-domain applications and structure-activity relationships
收藏DataCite Commons2025-07-31 更新2025-09-08 收录
下载链接:
https://tandf.figshare.com/articles/dataset/An_effective_machine_learning_model_for_rat_acute_oral_toxicity_prediction_of_emerging_chemicals_multi-domain_applications_and_structure-activity_relationships/29712744
下载链接
链接失效反馈官方服务:
资源简介:
Given the widespread presence of emerging contaminants in the environment, assessing and ensuring their biosafety is urgent. Under the Globally Harmonized System (GHS), the LD<sub>50</sub> parameter of acute oral toxicity (AOT) is crucial for chemical safety classification. Animal testing limitations have highlighted the need for alternative methods, and machine learning offers a new approach to predict LD<sub>50</sub> through quantitative structure-activity relationship (QSAR) models. This study developed and optimized a machine learning model for LD<sub>50</sub> classification of emerging contaminants based on data from more than 6000 known AOT. Using molecular descriptors and fingerprints, the model achieves an accuracy above 0.86 and a recall score over 0.84, outperforming previous models. The model’s robustness was confirmed across various types of emerging contaminants. Shapley additive explanations (SHAP) identified key descriptors like BCUTp_1h, ATSC1pe, and SLogP_VSA4, while the information gain (IG) method highlighted alert substructures [P-O, P-S]. These findings suggest that compounds with high polarizability, mean electronegativity and significant surface area may adversely affect rats. This model enhances understanding of acute toxicity mechanisms and serves as a tool for early screening of safer compounds, promoting the design of greener chemicals.
提供机构:
Taylor & Francis
创建时间:
2025-07-31



