five

MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context

收藏
DataCite Commons2025-12-10 更新2025-04-16 收录
下载链接:
https://physionet.org/content/medvh/
下载链接
链接失效反馈
官方服务:
资源简介:
Large Vision Language Models (LVLMs) have recently achieved superior performance in various tasks on natural image and text data, which inspires a large amount of studies for LVLMs fine-tuning and training. Despite their advancements, there has been scant research on the robustness of these models against hallucination when fine-tuned on smaller datasets. In this study, we introduce a new benchmark dataset, the Medical Visual Hallucination evaluation benchmark (MedVH), to evaluate the hallucination of domain-specific LVLMs. MedVH comprises six tasks to evaluate hallucinations in LVLMs within the medical context, which includes tasks for a comprehensive understanding of textual and visual input, as well as long textual response generation.
提供机构:
PhysioNet
创建时间:
2025-03-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作