five

RVLab-CBNU/SALUS

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/RVLab-CBNU/SALUS
下载链接
链接失效反馈
官方服务:
资源简介:
SALUS是一个多选视觉问答(VQA)数据集,旨在公平和稳健地评估视觉语言模型(VLMs)。与广泛使用的基于COCO数据集的VQA基准不同,SALUS是基于Docci数据集构建的,后者包含高度详细的人工编写的图像描述。通过GPT-4o的帮助生成10选1的多选VQA对,并经过严格的人工验证以确保质量和正确性。数据集还强调了减少预训练重叠和提高评估公平性的特点,以及通过结构化设计实现不确定性感知评估。数据集包含36,241个问答对(训练集23,905个,测试集12,336个),每个问题有10个选项,每个图像对应1到3个问答对。

SALUS is a multiple-choice Visual Question Answering (VQA) dataset designed to enable fair and robust evaluation of vision-language models (VLMs). Unlike widely used VQA benchmarks that rely on the COCO dataset, SALUS is constructed from the Docci Dataset, a dataset consisting of highly detailed, human-written image descriptions. Using these descriptions, we generate 10-choice multiple-choice VQA pairs with the assistance of GPT-4o, followed by strict human verification to ensure quality and correctness. The dataset emphasizes reducing pretraining overlap and improving evaluation fairness, while also enabling uncertainty-aware evaluation through its structured design. It contains 36,241 QA pairs (Train: 23,905, Test: 12,336), with 10 choices per question and 1~3 QA pairs per image.
提供机构:
RVLab-CBNU
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作