On effective spectral wideband models for clear sky atmospheric emissivity and transmissivity
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8187968
下载链接
链接失效反馈官方服务:
资源简介:
Overview
The HDF5 file contains primary measurement data and secondary processing data that was used to assess clear sky effective emissivity and transmissivity estimates and generate the results in the associated manuscript (accepted and forthcoming).
Data is indexed by solar time and provided per site for years 2010 through 2015. Sample Python code is provided to reconstruct training and validation sets by concatenating all 'tra' or 'val' samples across sites. Results can be explored by modifying choice of filters and constructing new training and validation sets.
Data usage
The usage of the data presented here is intended for research and development purposes only and implies explicit reference to the paper:Matsunobu, L. M., & Coimbra, C. F. M. (2024). On effective spectral wideband models for clear sky atmospheric emissivity and transmissivity. Journal of Geophysical Research: Atmospheres, 129, e2023JD039798. https://doi.org/10.1029/2023JD039798
Data description
Column names and descriptions are as follows:- dlw_m: measured downwelling longwave [W/m^2]- ghi_m: measured global horizontal irradiance [W/m^2]- dni_m: measured direct normal irradiance [W/m^2]- dhi_m: measured diffuse horizontal irradiance [W/m^2]- rh_m: measured relative humidity [%]- pa_m: measured atmospheric pressure [hPa]- t_m: measured temperature [K]- sza: solar zenith angle [deg]- ghi_c: clear sky global horizontal irradiance [W/m^2]- dni_c: clear sky direct normal irradiance [W/m^2]- dhi_c: clear sky diffuse horizontal irradiance [W/m^2]- cs1: clear sky filter 1- cs2: clear sky filter 2- site_elev: station elevation [m]- clr_pct: fraction of samples identified as clear for the given site and day- clr_num: number of samples identified as clear for the given site and day- pw_hpa: water vapor partial pressure [hPa]- alt_correction: altitude correction- tra: indicate if sample is included in training set- val: indicate if sample is included in validation set- sqrt_pw: square root of non-dimensional water vapor partial pressure- e_sky: effective clear sky emissivity
The last two columns, 'sqrt_pw' and 'e_sky' represent the input and target for linear regression, i.e. e_sky = c_1 + (c_2 * sqrt_pw).Altitude corrected sky emissivity, or expected emissivity for a station at sea-level, is found by e_sky - alt_correction.
Sample code (Python v3.8)
import pandas as pd
site = "GWC" # or other station code
df = pd.read_hdf("data.h5", key=site) # import single site
Training and validation sets can be reconstructed as below. Linear regression on 'sqrt_pw' to predict 'e_sky' - 'alt_correction' in the resultant training set will reproduce results in the associated manuscript.
training = []
validation = []
surfrad_sites = ['BON', 'DRA', 'FPK', 'GWC', 'PSU', 'SXF', 'TBL']
for site in surfrad_sites: # loop through sites
df = pd.read_hdf("data.h5", key=site)
df["site"] = site # add site name
training.append(df.loc[df.tra]) # append samples marked as training
validation.append(df.loc[df.val]) # append samples marked as validation
# join respective set samples across sites
training = pd.concat(training, ignore_index=False)
validation = pd.concat(validation, ignore_index=False)
Reproduce regression results
from sklearn.linear_model import LinearRegression
c1 = 0.6 # set intercept (c1 constant)
x = training.sqrt_pw.to_numpy().reshape(-1, 1)
y = training.e_sky - training.alt_correction - c1 # adjust for altitude and c1
y = y.to_numpy().reshape(-1, 1)
model = LinearRegression(fit_intercept=False)
model.fit(x, y)
c2 = model.coef_[0][0]
print(f"c1={c1:.3f}, c2={c2:.3f}")
# output: c1=0.600, c2=1.652
创建时间:
2024-03-22



