five

Regression tools for chemical release modeling: An additive manufacturing case study

收藏
Taylor & Francis Group2025-01-13 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Regression_tools_for_chemical_release_modeling_An_additive_manufacturing_case_study/28200081/1
下载链接
链接失效反馈
官方服务:
资源简介:
Chemical release data are essential for performing chemical risk assessments to understand the potential exposures arising from industrial processes. Often, these data are unknown or unavailable and must be estimated. A case study of volatile organic compound releases during extrusion-based additive manufacturing is used here to explore the viability of various regression methods for predicting chemical releases to inform chemical assessments. The methods assessed in this work include linear Least Squares, Least Absolute Shrinkage and Selection Operator (LASSO) and Ridge regression, classification and regression tree, random forest model, and neural network analysis. Secondary data describing polymeric extrusion in multiple applications are curated and assembled in a dataset to support regression modeling using default parameters for the various approaches. The potential to add noise to the dataset and improve regression is evaluated using synthetic data generation. Evaluation of model performance for a common test set found all methods were able to achieve predictions within 10%-error for up to 98% of the test sample population. The degree to which this level of performance was maintained when varying the number and type of features for regression was dependent on the model type. Linear methods and neural network analysis predicted the most test samples within 10%-error for smaller numbers of features while tree-based approaches could accommodate a larger number of features. The number and type of features can be important if the desire is to make chemical-specific release predictions. The inclusion of release data from related processes generally improved test set predictions across all models while the use of synthetic data as implemented here resulted in smaller increases in test sample predictions within 10%-error. Future work should focus on improving access to primary data and optimizing models to achieve maximum predictive performance of environmental releases to support chemical risk assessment.
提供机构:
Gonzalez, Michael A.; Meyer, David E.; Barrett, William M.; Smith, Raymond L.; Lanphear, Elizabeth; Takkellapati, Sudhakar; Chea, John D.; Ruiz-Mercado, Gerardo J.
创建时间:
2025-01-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作