vector-institute/open-pmc-18m
收藏Hugging Face2026-03-04 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/vector-institute/open-pmc-18m
下载链接
链接失效反馈官方服务:
资源简介:
OPEN-PMC数据集是一个包含医学论文中图像-文本对的集合,旨在支持医学图像理解研究,包括医学图像标注、多模态学习、图像-文本检索和医学语言理解等任务。该数据集主要由英文文本组成,不包含预定义的数据划分,用户可以根据需要自行划分训练、验证和测试数据。数据集来源于BIOMEDICA,经过过滤和分解处理后,用于学术和研究目的。
The OPEN-PMC dataset is a collection of image-text pairs from medical papers, designed to support research in medical image understanding, including tasks such as medical image captioning, multimodal learning, image-text retrieval, and medical language understanding. The dataset primarily consists of English text, does not contain predefined splits, and is sourced from BIOMEDICA, filtered and decomposed for academic and research purposes.
提供机构:
vector-institute



