five

Honest Health OOC Dataset

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14670150
下载链接
链接失效反馈
官方服务:
资源简介:
The rapid proliferation of health-related misinformation, particularly in multimodal formats combining text and images, poses significant risks to public health and trust in medical systems. Existing datasets and evaluation frameworks inadequately address the unique challenges of assessing honesty in multimodal large language models (MLLMs) within the health domain. In this paper, we introduce the Honest OOC Dataset, a specialized dataset designed to evaluate the honesty of MLLMs in out-of-context (OOC) scenarios. The dataset includes 8,016 real health-related image-caption pairs, with both similarity-based and LLM-generated falsified captions that closely mimic real-world misinformation patterns. To complement the dataset, we propose a comprehensive benchmark that assesses model honesty across three dimensions: (1) Truthful Representation, evaluating faithfulness to input information; (2) Honest Uncertainty, examining the model's ability to express knowledge limitations; and (3) Evidence Honesty, assessing the accuracy of judgments based on external evidence. We evaluate five different MLLMs using this benchmark, revealing the varying levels of honesty exhibited by the models from multiple perspectives. Our findings highlight the importance of accurately interpreting external evidence and adhering to reliable information, which are crucial for effective OOC detection in the health domain.
创建时间:
2025-01-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作