MM-Hallu/HaELM
收藏Hugging Face2026-04-30 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/MM-Hallu/HaELM
下载链接
链接失效反馈官方服务:
资源简介:
HaELM数据集包含5,000个来自COCO val2014的图像-描述对,用于评估视觉语言模型(VLM)图像描述中的幻觉现象。每个图像有2-5个人工编写的参考描述和一个由MLLM(mPLUG-Owl)生成的描述,用于比较幻觉现象。数据集的特征包括图像、图像名称、参考描述、生成的描述和幻觉标签(yes表示准确,no表示幻觉,unknown表示未知)。数据集的评估指标包括描述幻觉率和准确率。数据来源于HaELM(arXiv 2023)。
HaELM dataset contains 5,000 image-caption pairs from COCO val2014 for evaluating hallucination in VLM image descriptions. Each image has 2-5 human-written reference captions and an MLLM (mPLUG-Owl) generated caption for hallucination comparison. The dataset features include image, image name, reference captions, generated caption, and hallucination label (yes for accurate, no for hallucinated, or unknown). Evaluation metrics include caption hallucination rate and accuracy. The data is sourced from HaELM (arXiv 2023).
提供机构:
MM-Hallu



