iWISDM
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/BashivanLab/iWISDM
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是在iWISDM环境中构建的,旨在生成各种复杂度的视觉-语言任务,以评估在多模态环境下遵循指令的能力。该数据集包含三个对应不同复杂度任务的基准,并已用于评估多个大型多模态模型与人类表现的对标。这些任务的复杂度分为低、中、高三个级别,总计包含150项试验,任务内容主要是关于多模态任务中的指令遵循。
This dataset was constructed in the iWISDM environment, aiming to generate visual-language tasks of varying complexities to evaluate instruction-following capabilities in multimodal settings. It contains three benchmarks corresponding to tasks of distinct complexity levels, and has been used to assess multiple large multimodal models against human performance. The complexity of these tasks is categorized into three tiers: low, medium, and high, totaling 150 trials. The tasks primarily focus on instruction following in multimodal scenarios.
提供机构:
BashivanLab



