five

VitralColor-12: A Synthetic Twelve-Color Segmentation Dataset from GPT-Generated Stained-Glass Images

收藏
Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/c89n64y2x5/1
下载链接
链接失效反馈
官方服务:
资源简介:
VitralColor-12, a synthetic dataset for color classification and segmentation, utilizes LLMs in specific GPT-5 and DALL·E 3 models to generate images of stained-glass. This approach simplifies labeling by using the dark steel structure supporting the glass as a guide, which provides easy regions to label with a single color per region. After that, we obtain the images and use at least one hand-labelled centroid per color to automatically cluster all pixels based on Euclidean distance and morphological operations, including erosion and dilation with two iterations per operation, and a kernel size of 2 pixels. This process enables us to label a classification dataset and generate segmentation maps automatically. Our dataset comprises 910 images, organized into 70 generated images and 12 pixel segmentation maps—one for each color, which includes 9,509,524 labeled pixels, 1,794,758 of which are unique. We include tables with pixel values in RGB, HSL, CIELAB, and YCbCr color representations, enabling detailed color analysis and training of machine learning algorithms in different color spaces. Furthermore, we also included descriptive statistics and ΔE76, ΔE94, and CIELAB a vs b Chromacity, which prove the distribution, applicability, and realistic perceptual structures, including warm, neutral, and cold colors, as well as the high contrast between black and white colors, offering meaningful perceptual clusters, reinforcing its utility for color segmentation and classification.
提供机构:
Universidad Autonoma de Zacatecas; Universidad Politecnica de Aguascalientes
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作