five

SydneyMTL - Gastritis Sydney System Golden Dataset

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/r6xmbkys6m
下载链接
链接失效反馈
官方服务:
资源简介:
Data Description: Consensus-Based Hibou-L Embeddings for Gastritis Research (Updated Sydney System) Overview & Objective: The Updated Sydney System (USS) for chronic gastritis often suffers from subjective diagnostic thresholds. This dataset, curated via pathologist consensus, provides a high-fidelity "Golden Dataset" to support the development of standardized, objective AI models for gastritis grading. Data Content: The original gastric biopsy specimens were collected from Seegene Medical Foundation, Seoul, South Korea (https://pr.seegenemedical.com/). This dataset contains 366 feature embeddings of the gastric biopsy WSI images extracted using the Hibou-L foundational model. It represents the morphological signatures of the gastric mucosa across five key attributes: H. pylori, Neutrophil activity, Mononuclear cells, Glandular atrophy, and Intestinal metaplasia. The samples were curated using a stratified joint-distribution strategy to ensure a balanced representation of all severity grades (0–3). Structure & Interpretation: 1. HDF5 Files (.h5): Each file contains features (1024-D embeddings), coordinates, and addresses for patches within a Whole Slide Image (WSI). 2. labels_and_prediction.csv: Provides ground truth consensus grades and the predictions from the SydneyMTL model. Note that atrophy=4 indicates cases where grading was not applicable due to the absence of the muscularis mucosae, reflecting real-world clinical constraints. For model predictions, the values are formatted as: "label(confidence)" Applications: This dataset is specifically designed to advance research and development in gastritis diagnostics. It can be utilized to: - Develop and validate automated gastritis grading systems based on the Sydney System. - Benchmark Multi-Instance Learning (MIL) models for multi-attribute classification in gastric pathology. - Analyze morphological feature representations of various gastritis phenotypes using foundational pathology encoders.
创建时间:
2026-03-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作