five

Shale lithofacies identification based on heterogeneous ensemble learning in Hongxing area

收藏
中国科学数据2026-04-02 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.12431/issn.1000-1441.2025.0055
下载链接
链接失效反馈
官方服务:
资源简介:
The enrichment degree of shale oil and gas in the Hongxing area is highly correlated with lithofacies characteristics, which serve as a core indicator for shale reservoir evaluation in the region and directly affect the efficiency and accuracy of oil and gas exploration and development; to address the identification challenges of scarce lithofacies label samples and complex overlapping conventional logging response characteristics of shales in the Hongxing area, this study proposes a heterogeneous ensemble learning model integrating three machine learning methods: Bayesian classification, BP neural network, and random forest. Firstly, the model establishes a refined "3-endmember 4-component" lithofacies classification scheme based on mineral three-endmembers (siliceous, argillaceous, calcareous) and total organic carbon (TOC) content, accurately classifying the shales in the study area into five types: carbon-rich hybrid shales, carbon-rich siliceous shales, high-carbon hybrid shales, high-carbon siliceous shales, and medium-carbon calcareous shales. Subsequently, a Stacking ensemble framework is adopted, generating lithofacies prediction probability matrices of the three base learners (Bayesian classification, BP neural network, and random forest) through K-fold cross-validation, and optimizing weight allocation via an adaptive probability fusion mechanism to synergize the adaptability of Bayesian classification to scarce samples, the nonlinear fitting capability of BP neural network for complex features, and the strong stability of random forest, thereby realizing direct identification and rule characterization of target lithofacies based on conventional logging curves. Application of the model to shale lithofacies identification in Well HY1-4, a typical well in the Hongxing area, yields results showing that the model achieves a comprehensive identification accuracy of 93.19%, representing an average effective improvement of 14.85% compared with single methods such as Bayesian classification, BP neural network, and random forest; it maintains excellent robustness under extreme exploration scenarios including scarce samples, significant data noise, and unbalanced class distribution, and demonstrates favorable spatial generalization ability through cross-regional validation in Well FY10 of the Fuxing area.
创建时间:
2026-03-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作