"IRIS: Intent Resolution via Inference-time Saccades for Open-Ended VQA in Large Vision-Language Models Dataset"
收藏DataCite Commons2026-04-25 更新2026-05-03 收录
下载链接:
https://ieee-dataport.org/documents/iris-intent-resolution-inference-time-saccades-open-ended-vqa-large-vision-language
下载链接
链接失效反馈官方服务:
资源简介:
"This dataset accompanies IRIS (Intent Resolution via Inference-time Saccades) and contains eye-tracking and VQA trial data collected from a controlled user study of 500 unique image-question pairs spanning ambiguous and unambiguous visual queries. Each trial includes a full participant fixation sequence with normalized screen coordinates and timestamps, the participant's verbally posed question, a ground-truth answer, and responses from 11 state-of-the-art Vision-Language Models (VLMs) evaluated under three conditions: without eye data, with temporal filtering only, and with spatiotemporal filtering. Timing metadata captures the onset and offset of each participant's spoken question, enabling precise alignment of gaze behavior with verbal query initiation. The dataset is intended to support research on gaze-guided disambiguation in open-ended VQA and serves as a benchmark for evaluating the integration of real-time eye movement data into Large Vision-Language Models."
提供机构:
IEEE DataPort
创建时间:
2026-04-25



