CSEM-MISD - CSEM's Multi-Illumination Surface Defect Detection Dataset|机器视觉数据集|表面缺陷检测数据集

Mendeley Data2024-05-10 更新2024-06-30 收录

机器视觉

表面缺陷检测

下载链接：

https://zenodo.org/records/7410513

下载链接

链接失效反馈

资源简介：

In automated surface visual inspection, it is often necessary to capture the inspected part under many different illumination conditions to capture all the defects. To address this issue, at CSEM we have acquired a real-world multi-illumination defect segmentation dataset, called CSEM-MISD and we release it for research purposes to benefit the community. The dataset consists of three different types of metallic parts -- washers, screws, and gears. Parts were captured in a half-spherical light-dome system that filtered out all the ambient light and successively illuminated it from 108 distinct illumination angles. Each 12 illumination angles share the same elevation level and the relative azimuthal difference between the adjacent illumination angles on the same level is 30 degrees. For more details, please read Sections 3 and 4 of our paper. The washers dataset features 70 defective parts. The gears and screws datasets feature 35 defective, 35 intact and several hundred unannotated parts. Some defects, such as notches and holes, are visible in most images (illuminations) with intensity and texture variations among them, while others, such as scratches, are only visible in a few. We split the datasets into train and test sets. The train sets contain 32 samples, and the test set 38 samples. Each sample comprises 108 images (each captured under a different illumination angle), an automatically extracted foreground segmentation mask, and a hand-labeled defect segmentation mask. This dataset is challenging mainly because: each raw sample consists of 108 gray-scale images of resolution 512×512 and therefore takes 27MB of space; the metallic surfaces produce many specular reflections that sometimes saturate the camera sensors; the annotations are not very precise because the exact extent of defect contours is always subjective; the defects are very sparse also in the spatial dimensions: they cover only about 0.2% of the total image area in gears, 0.8% in screws, and 1.4% in washers; this creates an unbalanced dataset with a highly skewed class representation. The dataset is organized as follows: each sample resides in the Test, Train, or Unannotated directory; each sample has its own directory which contains the individual images, the foreground, and defect segmentation masks; each image is stored in 8-bit greyscale png format and has a resolution of 512 x 512 pixels; Image file names are formatted using three string fields separated with the underscore character: prefix_sampleNr_illuminationNr.png, where the prefix is e.g. washer, the sampleNr might be a three-digit number 001, and the illuminationNr is formed of 3 digits, first corresponding to the elevation index (1 - highest angle, 9 - lowest angle), and the additional two corresponding to the azimuth index (01-12). Each dataset contains light_vectors.csv, which contains the illumination angles (in lexicographic order of the illuminationNr), and light_intensities.csv that contains the numbers corresponding to the light intensity on the scale from 0 to 127. Please, be aware, that the azimuth angles were not calibrated and might be a few degrees misaligned. We provide data loaders implemented in python at the project's repository. If you find our dataset useful, please cite our paper: Honzátko, D., Türetken, E., Bigdeli, S. A., Dunbar, L. A., & Fua, P. (2021). Defect segmentation for multi-illumination quality control systems. Machine vision and Applications.

创建时间：

2023-06-28

用户留言

有没有相关的论文或文献参考？

这个数据集是基于什么背景创建的？

数据集的作者是谁？

能帮我联系到这个数据集的作者吗？

这个数据集如何下载？

点击留言

数据主题

具身智能

数据集 4098个

机构 8个

大模型

数据集 439个

机构 10个

无人机

数据集 37个

机构 6个

指令微调

数据集 36个

机构 6个

蛋白质结构

数据集 50个

机构 8个

空间智能

数据集 21个

机构 5个

5,000+

优质数据集

54 个

任务类型

进入经典数据集

热门数据集

TCIA

TCIA（The Cancer Imaging Archive）是一个公开的癌症影像数据集，包含多种癌症类型的医学影像数据，如CT、MRI、PET等。这些数据通常与临床和病理信息相结合，用于癌症研究和临床试验。

www.cancerimagingarchive.net 收录

中国农村教育发展报告

该数据集包含了中国农村教育发展的相关数据，涵盖了教育资源分布、教育质量、学生表现等多个方面的信息。

www.moe.gov.cn 收录

stochastic/random_streetview_images_pano_v0.0.2

随机街景图像数据集是从randomstreetview.com抓取的带有标签的全景图像。每张图像显示一个可以通过Google Street View访问的位置，这些图像被大致组合以提供单个位置的约360度视角。该数据集的设计目的是仅基于其视觉内容对图像进行地理定位。数据集包含约10,000张图像，涵盖了55个国家的约175张照片，主要集中在欧洲和亚洲。

hugging_face 收录

Pet Disease images

Comprehensive Image Dataset for Detecting Pet Diseases Across Multiple Species

kaggle 收录

Club Football Match Data (2000 - 2025)

该数据集提供了一个简单的入口，用于分析全球27个国家和42个联赛的足球比赛数据，包括英超、德甲和西甲等顶级联赛。数据涵盖了从2000/01赛季到2024/25赛季的最新比赛结果。数据集还包括Elo评分，每月的1号和15号对欧洲约500支最佳球队进行快照。

github 收录