Topsoil bulk geochemical compositions - An updated harmonized global dataset
收藏DataCite Commons2026-03-26 更新2026-04-25 收录
下载链接:
https://www.osti.gov/servlets/purl/3024128
下载链接
链接失效反馈官方服务:
资源简介:
Mineral weathering is a key biogeochemical process because of the capacity of minerals to stabilize organic matter. However, predicting soil weathering status across large spatial areas still isn’t possible due to a lack of global data and theoretical frameworks. To address this knowledge gap, multiple global datasets of bulk topsoil geochemical compositions have been harmonized using R. These datasets document topsoil bulk geochemical compositions across five continents (n = ~16,000 observations). Source data for these observations include the EuroGEOSurveys Geochemical Baseline Database (FOREGS), the US Geological Survey National Geochemical Database (NASGLP), the Geochemical Atlas of Australia (GAA), the US Geological Survey Alaska Geochemical Database (AGD84), the National Cooperative Soil Survey (NCSS), the European Geochemical Mapping of Agricultural Soil (GEMAS), Ecorespira-Amazon (ERA), the New Zealand Geochemical Baseline Survey (NZ_GBS), and the African Soil Information Service (AFSIS). Major elements observed include Aluminum (Al), Calcium (Ca), Iron (Fe), Potassium (K), Magnesium (Mg), Sodium (Na), Titanium (Ti), Manganese (Mn), Phosphorus (P), Carbon (C), and Sulfur (S). This data package includes the harmonized dataset itself, and the R scripts necessary to harmonize these datasets, in addition to metadata that describes all columns, files, and databases used in this project.
Methods & Sampling
Step 1 – Databases of geochemical data identified
This study aimed to leverage existing measurements of topsoil geochemical data. Databases were first identified and deemed appropriate for inclusion if they were measuring soils and performed these measurements on the <2mm soil fraction. Databases such as NCSS and AGD84 needed more post processing to include in the database and this was done using the NCSS_datamerge_031626 R file and Alaska_USGSmerge_031626 R file, respectively.
Step 2 – Database harmonization
Once appropriate databases were identified, they were harmonized for ease of analysis using the R script Database_Harmonization_031826. This included removing columns from original datasets that would not be used in analysis (removed columns are noted in the code). Then, data cleaning procedures specific to each dataset were undertaken. This includes standardizing columns to include units and adding metadata columns regarding procedures for analyzing specific elements. Functions for standardizing measurements and units are outline in R files: calculate element_mg_kg_031626, calculate_oxide_wt_perc_031626, change_oxide_caps_031626, and conv_2_numeric_031626. This also included adding a unique identifier for each sample to identify it with its respective database (see CD_ID in data dictionary).
Geographic information:
Data reflect a compilation of datasets collected globally. Geographic areas covered by each of the datasets include:
- EuroGEOSurveys Geochemical Baseline Database (FOREGS) - European continent
- North American Soil Geochemical Landscapes (NASGLP) - continental United States and limited parts of Canada (see database key for more details)
- National Geochemical Survey of Australia (GAA) - Australia
- Alaska geochemical database (AGDB4) - Alaska
- National Cooperative Soil Survey (NCSS) - Global measurements, but concentrated in the continental United States
- Geochemical data for arable land and land under permanent grass cover in continental Europe (GEMAS) - continental Europe
- Ecorespira-Amazon (ERA) - Geochemical data from the Amazon basin
- Geochemical baseline data for New Zealand (NZGBS) - New Zealand
- Geochemical data collected across continental Africa (AfSIS) - Measurements across Africa
提供机构:
Understanding the Geochemical Basis for Soil Organic Matter Storage at the Global Scale
创建时间:
2026-03-23



