ServiceNow-AI/GRAFT_benchmark
收藏Hugging Face2025-09-23 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/ServiceNow-AI/GRAFT_benchmark
下载链接
链接失效反馈官方服务:
资源简介:
GRAFT是一个多模态基准测试,旨在严格评估基础模型在视觉推理、结构化数据理解和指令遵循方面的能力。该基准测试包括多个配置,代表数据创建和评估的不同阶段。数据集包含了各种类型的图表和表格,以及与之相关的问答和元数据。
GRAFT is a multimodal benchmark designed to rigorously evaluate foundation models on tasks involving visual reasoning, structured data understanding, and instruction-following over synthetic yet realistic charts and tables. The benchmark includes multiple configurations representing different phases of data creation and evaluation.
提供机构:
ServiceNow-AI



