Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
收藏DataCite Commons2025-07-17 更新2025-09-08 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Hyperspectral_dataset_of_historical_documents_and_mock-ups_from_400_to_1700_nm_HYPERDOC_/28319165
下载链接
链接失效反馈官方服务:
资源简介:
HYPERDOC is a hyperspectral imaging dataset of historical documents and mock-ups, designed to facilitate research in material identification in the cultural heritage domain. It contains mock-ups of historical inks (metallo-gallate, sepia, carbon-based, and mixtures) on various supports, including some artificially aged, and historical documents from the 15th to 17th centuries (manuscripts, illuminated manuscripts, and family trees). Hyperspectral reflectance images were acquired using line-scan cameras in the VNIR (400-1000 nm) and SWIR (900-1700 nm) ranges and were spatially registered. Small regions of interest, referred to as 'minicubes', were extracted from the full document images, and pixel-level ground truth material annotations were performed. False-color RGB images and metadata were included in both the full document and minicube captures. The HYPERDOC dataset has been successfully applied in various experimental studies, including ink classification using machine learning models, spectral unmixing, colorimetric analysis, and binarization. These applications highlight the dataset's potential, which is publicly available to promote interdisciplinary collaboration and advance the use of hyperspectral imaging in the conservation field.
HYPERDOC是一款面向文化遗产领域材料识别研究的高光谱成像(hyperspectral imaging)数据集。其收录了多种载体上的历史墨水仿真样本,涵盖没食子酸金属盐墨水、乌贼墨、碳基墨水及其混合配方,其中部分载体经过人工老化处理;同时包含15至17世纪的历史文献,类型涵盖手抄本、彩绘手抄本与家谱。研究团队使用线扫描相机,在可见近红外(VNIR,400–1000 nm)与短波红外(SWIR,900–1700 nm)波段采集了高光谱反射图像,并完成了空间配准。从完整文献图像中提取了小型感兴趣区域,将其命名为“微型立方体(minicubes)”,并对其进行了像素级的材料标注真值。完整文献数据集与微型立方体数据集均包含伪彩色RGB图像与元数据。HYPERDOC数据集已成功应用于多项实验研究,包括基于机器学习模型的墨水分类、光谱解混、比色分析以及图像二值化。该数据集现已公开上线,旨在促进跨学科合作,推动高光谱成像在文物保护领域的应用发展。
提供机构:
figshare
创建时间:
2025-01-30



