WSI-Babel-Shark: Empty Whole-Slide Images for Slide-Label Metadata Extraction

Name: WSI-Babel-Shark: Empty Whole-Slide Images for Slide-Label Metadata Extraction
Creator: heiDATA
Published: 2025-12-17 12:56:52
License: 暂无描述

DataCite Commons2025-12-17 更新2026-05-07 收录

下载链接：

https://heidata.uni-heidelberg.de/citation?persistentId=doi:10.11588/DATA/ZBS9RS

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains 22 whole-slide image (WSI) files in SVS format, digitized using a Leica GT450 scanner. All WSIs were intentionally scanned without tissue; only the physical slide labels are present. The purpose of this dataset is to support the evaluation and benchmarking of the WSI-Babel-Shark metadata-extraction pipeline. Empty slides allow reduced file sizes, preservation of SVS metadata, and controlled conditions for benchmarking label-processing components, including OCR, DataMatrix decoding, stain parsing, SlideID reconstruction, and metadata harmonization. All WSIs retain full TIFF tiling, SVS headers, and Leica metadata. Files were manually inspected to ensure complete de-identification, and all CaseIDs and SlideIDs represent synthetic test cases. A ground-truth CSV file containing validated metadata fields is included for benchmarking. No patient-identifying information is contained in any file.

提供机构：

heiDATA

创建时间：

2025-11-20

5,000+

优质数据集

54 个

任务类型

进入经典数据集