MAKVEN: A Visible Spectral Dataset for Colour Science and Machine Learning
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/MAKVEN_A_Visible_Spectral_Dataset_for_Colour_Science_and_Machine_Learning/30483962
下载链接
链接失效反馈官方服务:
资源简介:
MAKVEN is a spectral dataset covering the visible range (380–780 nm, 5 nm interval). It integrates both measured and reconstructed spectra, designed to provide efficient coverage of the CIE XYZ colour space for applications in colour science, bioinformatics, and machine learning.
The dataset was created to address limitations of existing collections. Classical references such as the Munsell Matt (1269 spectra) and the Macbeth ColourChecker (24 spectra) provide calibration anchors but do not cover the entire colour space. Natural datasets (e.g., Southern Cone, 916 spectra) contribute ecological diversity, while hyperspectral imagery (Foster et al., 2002, Scene 5) adds natural variability but is strongly redundant at the pixel level. To complement these resources, synthetic and reconstructed spectra were generated to fill sparsely represented areas of the colour space, based on spectral reconstruction from a CIELAB grid (Kang, 2006).
All spectra were interpolated to a common support (380–780 nm, 5 nm). Metadata indicate the origin of each spectrum, with the following distribution:
synthetic: 7395 spectrareconst: 1105 spectrarefs/natural: 916 spectranatural: 3118 spectrahiper/natural: 1124 spectraThis combination provides a reproducible, efficient, and accessible resource for training and evaluation of models in colourimetric prediction, luminous transmittance estimation, and portable sensor design. Users are free to define training and testing splits depending on their research goals (e.g., training on reconstructed spectra, validation on measured spectra).
The dataset is released under a Creative Commons Attribution 4.0 International (CC-BY 4.0) license.
创建时间:
2025-10-30



