"IRIS: Intent Resolution via Inference-time Saccades for Open-Ended VQA in Large Vision-Language Models Dataset"

Name: "IRIS: Intent Resolution via Inference-time Saccades for Open-Ended VQA in Large Vision-Language Models Dataset"
Creator: IEEE DataPort
Published: 2026-04-25 00:04:59
License: 暂无描述

DataCite Commons2026-04-25 更新2026-05-03 收录

下载链接：

https://ieee-dataport.org/documents/iris-intent-resolution-inference-time-saccades-open-ended-vqa-large-vision-language

下载链接

链接失效反馈

官方服务：

资源简介：

"This dataset accompanies IRIS (Intent Resolution via Inference-time Saccades) and contains eye-tracking and VQA trial data collected from a controlled user study of 500 unique image-question pairs spanning ambiguous and unambiguous visual queries. Each trial includes a full participant fixation sequence with normalized screen coordinates and timestamps, the participant's verbally posed question, a ground-truth answer, and responses from 11 state-of-the-art Vision-Language Models (VLMs) evaluated under three conditions: without eye data, with temporal filtering only, and with spatiotemporal filtering. Timing metadata captures the onset and offset of each participant's spoken question, enabling precise alignment of gaze behavior with verbal query initiation. The dataset is intended to support research on gaze-guided disambiguation in open-ended VQA and serves as a benchmark for evaluating the integration of real-time eye movement data into Large Vision-Language Models."

提供机构：

IEEE DataPort

创建时间：

2026-04-25

5,000+

优质数据集

54 个

任务类型

进入经典数据集