five

Bulamalaminu001/Happy

收藏
Hugging Face2026-01-02 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Bulamalaminu001/Happy
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - question-answering - text-generation language: - en tags: - GPT-4V - Vision - medical - biology size_categories: - 1M<n<10M configs: - config_name: PubMedVision_Alignment_VQA data_files: PubMedVision_Alignment_VQA.json - config_name: PubMedVision_InstructionTuning_VQA data_files: PubMedVision_InstructionTuning_VQA.json - config_name: _Original_Caption data_files: PubMedVision_Original_Caption.json - config_name: _Chinese_Version data_files: PubMedVision_Chinese.json --- ## News - [2025/02/18]: We add the original captions of PubMedVision in `PubMedVision_Original_Caption.json`, as well as the Chinese version of PubMedVision in `PubMedVision_Chinese.json`. - [2024/07/01]: We add annotations for 'body_part' and 'modality' of images, utilizing the [HuatuoGPT-Vision-7B](https://huggingface.co/FreedomIntelligence/HuatuoGPT-Vision-7B) model. ## PubMedVision PubMedVision is a large-scale medical VQA dataset. We extracted high-quality image-text pairs from PubMed and used GPT-4V to reformat them to enhance their quality. PubMedVision significantly improves the multimodal capabilities of MLLMs in the medical field. For more details, refer to our [paper](https://arxiv.org/abs/2406.19280) and [github](https://github.com/FreedomIntelligence/HuatuoGPT-Vision). ## Data Volume PubMedVision contains 1.3 million medical VQAs, divided into Alignment VQA and Instruction Tuning VQA: | Data | # Data | | ---------- | ---------- | | PubMedVision_Alignment_VQA | 647,031 | | PubMedVision_InstructionTuning_VQA | 647,031 | | **Total** | **1,294,062** | ## Image Data `images_*.zip` contains the compressed image data. You can unzip these images using the following code: ```bash for ((i=0; i<20; i++)) do unzip -j images_$i.zip -d images/ & # wait patiently, it takes a while... done ``` ## Citation If you find our data useful, please consider citing our work! We are FreedomIntelligence from [Shenzhen Research Institute of Big Data](http://sribd.cn/en) and [The Chinese University of Hong Kong, Shenzhen](https://sds.cuhk.edu.cn/en) ``` @misc{chen2024huatuogptvisioninjectingmedicalvisual, title={HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale}, author={Junying Chen and Ruyi Ouyang and Anningzhe Gao and Shunian Chen and Guiming Hardy Chen and Xidong Wang and Ruifei Zhang and Zhenyang Cai and Ke Ji and Guangjun Yu and Xiang Wan and Benyou Wang}, year={2024}, eprint={2406.19280}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2406.19280}, } ```
提供机构:
Bulamalaminu001
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作