MEDISEG
收藏Figshare2025-03-14 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/MEDISEG/28574786
下载链接
链接失效反馈官方服务:
资源简介:
Dataset OverviewMEDISEG (MEDication Image SEGmentation) is a high-quality, real-world dataset designed for the development and evaluation of pill recognition models. It contains two subsets:MEDISEG (3-Pills): A controlled dataset featuring three pill types with subtle differences in shape and color.MEDISEG (32-Pills): A more diverse dataset containing 32 distinct pill classes, reflecting real-world challenges such as occlusions, varied lighting conditions, and multiple medications in a single frame.Each subset includes COCO-format annotations with instance segmentation masks, bounding boxes, and class labels.Dataset StructureThe dataset is organized as follows:MEDISEG/│── LICENSE│── metadata.csv│── 3pills/│ ├── annotations.json│ ├── images/│ │ ├── image1.jpg│ │ ├── image2.jpg│── 32pills/│ ├── annotations.json│ ├── images/│ │ ├── image1.jpg│ │ ├── image2.jpgLICENSE: The CC BY 4.0 license under which the dataset is distributed.metadata.csv: Supplementary drug information, including registration numbers, brand names, active ingredients, regulatory classifications, and official URLs.annotations.json: COCO-format annotation files providing segmentation masks, bounding boxes, and class labels.images/: High-resolution JPG images of medications.AcknowledgementsIf you use this dataset, please cite the corresponding publication:bibtex@article{MEDISEG2026,title = {A dataset of medication images with instance segmentation masks for preventing adverse drug events},author = {Chu, Wai Ip and Hirani, Shashi and Tarroni, Giacomo and Li, Ling},journal={arXiv preprint arXiv:2603.10825},year = {2026},doi={10.48550/arXiv.2603.10825}}
数据集概览
MEDISEG(药物图像分割,MEDication Image SEGmentation)是一款高质量的真实场景数据集,专为药物识别模型的研发与评估构建。该数据集包含两个子集:
MEDISEG(3-Pills):该受控数据集涵盖3种形状与色彩差异细微的药物类型。
MEDISEG(32-Pills):该数据集多样性更强,包含32种不同的药物类别,可反映真实场景中的各类挑战,如遮挡、光照条件多变及单帧内包含多种药物的场景。
每个子集均包含采用COCO格式的标注文件,涵盖实例分割掩码、边界框与类别标签。
数据集结构
该数据集的组织形式如下:
MEDISEG/
├── LICENSE
├── metadata.csv
├── 3pills/
│ ├── annotations.json
│ └── images/
│ ├── image1.jpg
│ └── image2.jpg
└── 32pills/
├── annotations.json
└── images/
├── image1.jpg
└── image2.jpg
各文件说明:
LICENSE:本数据集采用CC BY 4.0许可证进行分发。
metadata.csv:补充药物信息文件,包含药品注册编号、商品名、活性成分、监管分类以及官方网址。
annotations.json:采用COCO格式的标注文件,提供实例分割掩码、边界框与类别标签。
images/:存储药物的高分辨率JPG格式图像。
致谢
若您使用本数据集,请引用如下相关文献:
@article{MEDISEG2026,
title = "用于预防药物不良事件的带实例分割掩码的药物图像数据集",
author = "Chu, Wai Ip and Hirani, Shashi and Tarroni, Giacomo and Li, Ling",
journal="arXiv预印本 arXiv:2603.10825",
year = "2026",
doi="10.48550/arXiv.2603.10825"
}
创建时间:
2025-03-14



