five

OpenMed/synthvision-seeds

收藏
Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/OpenMed/synthvision-seeds
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - visual-question-answering tags: - medical - synthvision - openmed size_categories: - 100K<n<1M --- # synthvision-seeds ![SynthVision](synthvision_featured.png) Seed records from 4 open medical image datasets **Records**: 119,137 ## About Seed dataset for the [SynthVision pipeline](https://huggingface.co/blog/OpenMed/synthvision). Contains 119,137 records aggregated from 4 open medical image datasets: | Source | Records | Modality | |--------|---------|----------| | [eltorio/ROCO-radiology](https://huggingface.co/datasets/eltorio/ROCO-radiology) | 65,393 | Radiology | | [OpenMed/multicare-images](https://huggingface.co/datasets/OpenMed/multicare-images) | 50,000 | Mixed | | [flaviagiammarino/path-vqa](https://huggingface.co/datasets/flaviagiammarino/path-vqa) | 3,430 | Pathology | | [flaviagiammarino/vqa-rad](https://huggingface.co/datasets/flaviagiammarino/vqa-rad) | 314 | Radiology | Images are deduplicated by SHA-256 hash. Each record contains an image path, source dataset ID, modality, and any available metadata (captions or Q&A pairs). ## Schema ``` id: str # unique record ID image: str # relative image path source: str # source dataset name modality: str # imaging modality metadata: dict # captions, Q&A pairs, or labels ``` ## Loading ```python from datasets import load_dataset ds = load_dataset("OpenMed/synthvision-seeds") ``` ## Links - [SynthVision blog post](https://huggingface.co/blog/OpenMed/synthvision) - [Source code](https://github.com/openmed-labs/synthvision) - [All SynthVision artifacts](https://huggingface.co/collections/OpenMed/synthvision-69baac655b557943aa1babd3) - [OpenMed on Hugging Face](https://huggingface.co/OpenMed)
提供机构:
OpenMed
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作