five

From Model Development to Mitigation: Machine Learning for Predicting and Minimizing Iodinated Trihalomethanes in Water Treatment

收藏
Figshare2025-06-03 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/From_Model_Development_to_Mitigation_Machine_Learning_for_Predicting_and_Minimizing_Iodinated_Trihalomethanes_in_Water_Treatment/29225966
下载链接
链接失效反馈
官方服务:
资源简介:
Disinfection processes in water treatment produce disinfection byproducts (DBPs), such as iodinated trihalomethanes (I-THMs), which pose significant health risks. Mitigating I-THMs remains challenging due to the complex interactions among water quality parameters, disinfectants, and iodine sources, compounded by the difficulty of predicting their formation under varying treatment conditions. This study leverages a data set of 1534 samples from published studies to predict I-THM formation using machine learning (ML). Among five evaluated ensemble models, CatBoost Regression achieved the best performance. Incorporating domain-specific features (iodine/DOC and oxidant/DOC ratios) improved model accuracy and interpretability. Recursive feature elimination revealed that nearly half of the features could be excluded without compromising performance, simplifying model development and reducing experimental effort, an advantage often overlooked in prior research. Feature analysis identified key predictors and mitigation strategies, including minimizing iodine and bromide concentrations, reducing iodine/DOC, UV254 and SUVA levels, and optimizing chlorine dose. The model further enabled rapid identification of the optimal chlorine dose to minimize I-THMs using incremental and Bayesian optimization. Achieving an R2 of 0.67 on an external validation data set, the model demonstrated strong generalizability. This study establishes ML as a powerful tool for predicting and mitigating I-THMs, offering actionable strategies for safer drinking water treatment.
创建时间:
2025-06-03
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作