five

Machine learning-enhanced monitoring of global copper mining areas

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Machine_learning-enhanced_monitoring_of_global_copper_mining_areas/28691639
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset provides a comprehensive, site-specific global assessment of land use areas associated with copper mining activities as of 2022. Using machine learning methodologies applied to multispectral remote sensing data, we mapped and classified operational land-use features, including open-cut pits, waste rock dumps, and tailings storage facilities. The dataset covers a total of 1,313 copper mines across 80 countries, encompassing a combined spatial extent of approximately 7,267 km². Observations were made using Sentinel-2 satellite imagery, characterized by high spectral (13 bands, 443–2203 nm) and spatial (10 m) resolution. Parameters measured include spectral reflectance values, Normalized Difference Vegetation Index (NDVI), Normalized Difference Water Index (NDWI), Bare Soil Index (BSI), Enhanced Vegetation Index (EVI), Index-Based Built-up Index (IBI), and topographic information derived from Digital Elevation Models (DEM). Operational data, including historical copper production, production capacity, mining intensity, start and end dates of operations, were integrated from Standard & Poor’s and the US Geological Survey databases. The temporal coverage of the dataset is the year 2022, ensuring temporal consistency and accuracy across global mining areas. Spatial coverage is global, with significant data density in regions such as Canada, Australia, China, the United States, Chile, Peru, and Mexico. The primary purpose of this dataset was to quantify and evaluate the environmental impacts of global copper mining, particularly land use intensity and potential ecological consequences of mining activities. This dataset enables detailed environmental assessments, aids in ecological risk management, supports supply chain sustainability studies, and assists policy-makers and stakeholders in improving resource management and minimizing ecological impacts. Data collection involved preprocessing Sentinel-2 satellite imagery via the Google Earth Engine (GEE) platform, applying cloud-free median composites, and training a Random Forest classification algorithm using manually collected sample points for accurate delineation of mining features. Model validation achieved an overall accuracy of 91.08%. The dataset is provided in vector format, containing polygons annotated with detailed operational attributes, specifically: 1. Name: Copper mine name; 2. Latitude: Latitudinal coordinates of the mine centroid (WGS84, decimal degrees); 3. Longitude: Longitudinal coordinates of the mine centroid (WGS84, decimal degrees); 4. P_C: Primary commodity extracted from the mine; 5. List_of_C: List of secondary commodities extracted; 6. A_S: Activity status of the mine (e.g., Active, Inactive); 7. State/Prov: State or province location of the mine; 8. Country: Country location of the mine; 9. Land_use: Area altered by mining activities (square meters); 10. Cum_Prod: Historical cumulative copper production (metric tons); 11. MI: Mining intensity (100 m²/ton) This structure facilitates easy integration and usability across multiple research and management disciplines. The Google Earth Engine (GEE) script used for remote sensing classification is also included to facilitate replication and further analysis.
创建时间:
2025-07-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作