introvoyz041/MathVista
收藏Hugging Face2025-12-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/introvoyz041/MathVista
下载链接
链接失效反馈官方服务:
资源简介:
MathVista是一个在视觉环境中进行数学推理的综合基准测试。它包含三个新创建的数据集:IQTest、FunctionQA和PaperQA,这些数据集针对缺失的视觉领域,旨在评估对谜题测试图形的逻辑推理、对函数图的代数推理以及对学术论文图形的科学推理。此外,它还整合了文献中的9个MathQA数据集和19个VQA数据集,极大地丰富了视觉感知和数学推理挑战的多样性和复杂性。MathVista总共包含从31个不同数据集中收集的6,141个示例。
MathVista is a consolidated Mathematical reasoning benchmark within Visual contexts. It consists of three newly created datasets, IQTest, FunctionQA, and PaperQA, which address the missing visual domains and are tailored to evaluate logical reasoning on puzzle test figures, algebraic reasoning over functional plots, and scientific reasoning with academic paper figures, respectively. It also incorporates 9 MathQA datasets and 19 VQA datasets from the literature, which significantly enrich the diversity and complexity of visual perception and mathematical reasoning challenges within our benchmark. In total, MathVista includes 6,141 examples collected from 31 different datasets.
提供机构:
introvoyz041



