five

Data Sheet 1_Spatial prediction of ground substrate thickness in shallow mountain area based on machine learning model.pdf

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Data_Sheet_1_Spatial_prediction_of_ground_substrate_thickness_in_shallow_mountain_area_based_on_machine_learning_model_pdf/27073009
下载链接
链接失效反馈
官方服务:
资源简介:
IntroductionThe thickness of ground substrate in shallow mountainous areas is a crucial indicator for substrate investigations and a key factor in evaluating substrate quality and function. Reliable data acquisition methods are essential for effective investigation. MethodsThis study utilizes six machine learning algorithms—Gradient Boosting Machine (GB), Random Forest (RF), AdaBoost Regressor (AB), Neural Network (NN), Support Vector Machine (SVM), and k-Nearest Neighbors (kNN)—to predict ground substrate thickness. Grid search optimization was employed to fine-tune model parameters. The models’ performances were evaluated using four metrics: mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), and the coefficient of determination (R2). The optimal parameter combinations for each model were then used to calculate the spatial distribution of ground substrate thickness in the study area. ResultsThe results indicate that after parameter optimization, all models showed significant reductions in the MSE, RMSE, and MAE, while R2 values increased substantially. Under optimal parameters, the RF model achieved an MSE of 1,589, RMSE of 39.8, MAE of 26.5, and an R2 of 0.63, with a Pearson correlation coefficient of 0.80, outperforming the other models. Therefore, parameter tuning is a necessary step in using machine learning models to predict ground substrate thickness, and the performance of all six models improved significantly after tuning. Overall, ensemble learning models provided better predictive performance than other machine learning models, with the RF model demonstrating the best accuracy and robustness. DiscussionMoreover, further attention is required on the characteristics of sample data and environmental variables in machine learning-based predictions.
创建时间:
2024-09-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作