five

introvoyz041/EXAMS-V

收藏
Hugging Face2025-12-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/introvoyz041/EXAMS-V
下载链接
链接失效反馈
官方服务:
资源简介:
EXAMS-V是一个多语言、多模态的数据集,旨在评估和基准测试AI系统,特别是视觉语言模型(VLMs)的视觉推理能力。数据集包含24,856个来自真实学校考试和其他教育资源的多选题(MCQs),所有问题都以图像形式呈现。这些图像不仅包含文本,还包括表格、图表和数学内容,使EXAMS-V成为测试模型处理视觉和结构化信息能力的强大基准。问题用13种不同语言编写:英语、阿拉伯语、中文、德语、保加利亚语、意大利语、西班牙语、乌尔都语、波兰语、匈牙利语、塞尔维亚语和克罗地亚语,涵盖多个学科类别。数据集来自不同国家和教育体系的真实学校考试,具有区域特定知识、多样化问题格式和多语言内容的独特组合。回答EXAMS-V中的问题不仅需要阅读,还需要理解视觉布局、解释图表和符号,并对文本和视觉内容进行推理。

EXAMS-V is a multilingual, multimodal dataset created to evaluate and benchmark the visual reasoning abilities of AI systems, especially Vision-Language Models (VLMs). The dataset contains 24,856 multiple-choice questions (MCQs) collected from real school exams and other educational sources. All questions are presented as images. These images include not just text, but also tables, graphs, and mathematical content, which makes EXAMS-V a strong benchmark for testing how well models can handle visual and structured information. The questions are written in 13 different languages: English, Arabic, Chinese, German, Bulgarian, Italian, Spanish, Urdu, Polish, Hungarian, Serbian, and Croatian and they encompass multiple domain of subject categories. The dataset is curated from real school exams from different countries and education systems. This gives it a unique mix of region-specific knowledge, varied question formats, and multilingual content. Answering the questions in EXAMS-V is not just about reading. Models also need to understand the visual layout, interpret diagrams and symbols, and reason over both text and visuals.
提供机构:
introvoyz041
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作