PMC-OA
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/axiong/pmc-oa
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个庞大的开源医疗图像描述数据集,包含了1.65百万对图像与描述的组合,这些数据来源于PubMedCentral的开放获取资源。其规模达到了1.65百万对,旨在为视觉-语言任务提供预训练支持。
This is a large-scale open-source medical image captioning dataset containing 1.65 million image-description pairs, which are sourced from open-access resources of PubMedCentral. With a total of 1.65 million pairs, this dataset aims to provide pre-training support for vision-language tasks.
提供机构:
PubMedCentral



