VisuLogic/VisuLogic-Train
收藏Hugging Face2025-06-28 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/VisuLogic/VisuLogic-Train
下载链接
链接失效反馈官方服务:
资源简介:
VisuLogic是一个专门设计用于评估多模态大型语言模型(MLLMs)视觉推理能力的基准数据集。它包含跨越多个类别的精心构造的视觉推理任务,分为六种基于所需推理技能的类型(例如,定量推理涉及理解和推断图像中元素数量的变化)。与现有基准不同,VisuLogic是一个具有挑战性的视觉推理基准,其本质上难以用语言表达,为MLLMs的视觉推理能力提供了更严格的评估。
VisuLogic is a newly designed benchmark aimed at evaluating the visual reasoning capabilities of Multi-modal Large Language Models (MLLMs), independent of textual reasoning processes. It features carefully constructed visual reasoning tasks spanning multiple categories, divided into six types based on required reasoning skills (e.g., Quantitative Reasoning, which involves understanding and deducing changes in the quantity of elements in images). Unlike existing benchmarks, VisuLogic is a challenging visual reasoning benchmark that is inherently difficult to articulate using language, providing a more rigorous evaluation of the visual reasoning capabilities of MLLMs.
提供机构:
VisuLogic



