Original model dataset.
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Original_model_dataset_/30098331
下载链接
链接失效反馈官方服务:
资源简介:
Probabilistic Random Forest is an extension of the traditional Random Forest machine learning algorithm that is one of the frequently used machine learning algorithms employed for species distribution modeling. However, with the use of complex dataset for predicting the presence or absence of the species, It is essential that feature extraction is important to generate optimal prediction that can affect the model accuracy and AUC score of the model simulation. In this paper, we integrated the Genetic Algorithm Optimization technique, which is popular for its excellent feature extraction technique, to enhance the predictive performance of the PRF Model. a novel hybrid algorithm the genetically optimized probabilistic random forest algorithm, designed for predicting the distribution of mastomys natalensis in Nigeria. The model was also compared with existing models for dimensionality reduction with other optimization techniques, such as Principal Component Analysis, Grey Wolf, Optimizer optimized backpropagation neural network algorithm (GNNA), Butterfly Optimization Algorithm. These models were evaluated using four performance metrics, accuracy, the areas under curve, sensitivity, specificity, F1_score and precision. We also examined the spatial predictive distribution of the models. The results generated that the predictive performance of PRFGA, significantly improved compared to PRFPCA, GNNA and PRFBOA in predicting the presence or absence of mastomys natalensis with a presence only and pseudo-absence sample set. the PRFGA demonstrated a high predictive power in predicting the spatial distribution of the presence or absence of mastomys natalensis in Nigeria. The integration of the Genetic Algorithm optimization technique, stems from its renowned ability to address the specific challenges of data uncertainty and high-dimensionality reduction in feature extraction sets of SDMs, to enhance the performance of the PRF model.
创建时间:
2025-09-10



