PEACE

Name: PEACE
Creator: maas
Published: 2025-12-05 12:12:29
License: 暂无描述

魔搭社区2025-12-05 更新2025-07-26 收录

下载链接：

https://modelscope.cn/datasets/microsoft/PEACE

下载链接

链接失效反馈

官方服务：

资源简介：

# PEACE: Empowering Geologic Map Holistic Understanding with MLLMs [[`Code`](https://github.com/microsoft/PEACE)] [[`Paper`](https://arxiv.org/pdf/2501.06184)] [[`Data`](https://huggingface.co/datasets/microsoft/PEACE)] <img src="./images/GeoMap-Bench_Pipeline.png" width="800"> ## Introduction We construct a geologic map benchmark, GeoMap-Bench, to evaluate the performance of MLLMs on geologic map understanding across different abilities, the overview of it is as shown in below Table. <table> <thead> <tr> <th style="text-align:left;" >Property</th> <th style="text-align:left;" >Description</th> </tr> </thead> <tbody> <tr> <th style="text-align:left;" rowspan="2">Source</th> <th style="text-align:left;" >USGS(English)</th> </tr> <tr> <th style="text-align:left;" >CGS(Chinese)</th> </tr> <tr> <th style="text-align:left;" >Content</th> <th style="text-align:left;" >Image-question pair with annotated answer</th> </tr> <tr> <th style="text-align:left;" >Scale</th> <th style="text-align:left;" >124 images and 3,864 questions</th> </tr> <tr> <th style="text-align:left;" >Resolution</th> <th style="text-align:left;" >6,1462 pixels on average</th> </tr> <tr> <th style="text-align:left;" rowspan="3">Question Type</th> <th style="text-align:left;" >1.Multiple-choicequestion</th> </tr> <tr> <th style="text-align:left;" >2.Fill-in-the-blankquestion</th> </tr> <tr> <th style="text-align:left;" >3.Essayquestion</th> </tr> <tr> <td style="text-align:left;" rowspan="5">Covering Ability</td> <td style="text-align:left;" >1.Extracting</td> </tr> <tr> <td style="text-align:left;" >2.Grounding</td> </tr> <tr> <td style="text-align:left;" >3.Referring</td> </tr> <tr> <td style="text-align:left;" >4.Reasoning</td> </tr> <tr> <td style="text-align:left;" >5.Analyzing</td> </tr> <tr> <td style="text-align:left;" >Defined Task</td> <td style="text-align:left;" >25 tasks</td> </tr> </tbody> </table> ## Data Instance ```Python { "question: "According to this geologic map, regarding the rock type in main map, which one has the smallest area among 4 choices?", "answer": "D", "type": "reasoning-area_comparison", "A": "Torok Formation", "B": "Surficial deposits, undivided (Holocene and Pleistocene)", "C": "Lower part", "D": "Alluvial deposits, undivided (Holocene and Pleistocene)", "mcq": true, "img_path": "16809_83756_4.jpg" } ``` ## Data Fields - `question`: The question - `answer`: The annotated answer - `type`: The question type - `A`: Choice A - `B`: Choice B - `C`: Choice C - `D`: Choice D - `mcq`: Whether the question is multiple-choice question - `img_path`: The image path of geologic map ## Data Distribution The distribution of evaluation abilities and tasks is demonstrated below. <img src="./images/GeoMap-Bench_Distribution.png" width="600"> ## Leaderboard Through comprehensive experiments, GeoMap-Agent achieves an overall score of 0.811 on GeoMap-Bench, significantly outperforming 0.369 of GPT-4o. | Method | Extracting | Grounding | Referring | Reasoning | Analyzing | Overall | |----------------------|------------|-----------|-----------|-----------|-----------|----------| | Random | 0 | 0 | 0.250 | 0.250 | 0 | 0.100 | | GPT-4o | 0.219 | 0.128 | 0.378 | 0.507 | 0.612 | 0.369 | | GeoMap-Agent | **0.832** | **0.920** | **0.886** | **0.588** | **0.831** | **0.811** | ## Citation ``` TBD ``` ## License The dataset is licensed under the MIT License.

# PEACE：依托多模态大语言模型（MLLMs）实现地质图全维度理解 [[`代码`](https://github.com/microsoft/PEACE)] [[`论文`](https://arxiv.org/pdf/2501.06184)] [[`数据集`](https://huggingface.co/datasets/microsoft/PEACE)] <img src="./images/GeoMap-Bench_Pipeline.png" width="800"> ## 引言本研究构建了地质图基准数据集GeoMap-Bench，用于评估多模态大语言模型（MLLMs）在不同能力维度下的地质图理解性能，其整体概况如下表所示。 <table> <thead> <tr> <th style="text-align:left;" >属性</th> <th style="text-align:left;" >描述</th> </tr> </thead> <tbody> <tr> <th style="text-align:left;" rowspan="2">数据来源</th> <th style="text-align:left;" >美国地质调查局（USGS，英文源）</th> </tr> <tr> <th style="text-align:left;" >中国地质调查局（CGS，中文源）</th> </tr> <tr> <th style="text-align:left;" >数据内容</th> <th style="text-align:left;" >带标注答案的图像-问题对</th> </tr> <tr> <th style="text-align:left;" >数据规模</th> <th style="text-align:left;" >124张地质图图像与3864个问题</th> </tr> <tr> <th style="text-align:left;" >图像分辨率</th> <th style="text-align:left;" >平均分辨率为6146²像素</th> </tr> <tr> <th style="text-align:left;" rowspan="3">问题类型</th> <th style="text-align:left;" >1. 选择题</th> </tr> <tr> <th style="text-align:left;" >2. 填空题</th> </tr> <tr> <th style="text-align:left;" >3. 简答题</th> </tr> <tr> <td style="text-align:left;" rowspan="5">覆盖能力维度</td> <td style="text-align:left;" >1. 信息提取</td> </tr> <tr> <td style="text-align:left;" >2. 实体定位</td> </tr> <tr> <td style="text-align:left;" >3. 指代理解</td> </tr> <tr> <td style="text-align:left;" >4. 逻辑推理</td> </tr> <tr> <td style="text-align:left;" >5. 综合分析</td> </tr> <tr> <td style="text-align:left;" >预设任务数</td> <td style="text-align:left;" >共25项任务</td> </tr> </tbody> </table> ## 数据实例 Python { "question": "根据该地质图，针对主图中的岩石类型，4个选项中面积最小的是哪一个？", "answer": "D", "type": "reasoning-area_comparison", "A": "托罗克组（Torok Formation）", "B": "未划分表层沉积（全新世与更新世）", "C": "下部岩层", "D": "未划分冲积沉积（全新世与更新世）", "mcq": true, "img_path": "16809_83756_4.jpg" } ## 数据字段说明 - `question`：问题文本 - `answer`：标注的标准答案 - `type`：问题类型 - `A`：选项A - `B`：选项B - `C`：选项C - `D`：选项D - `mcq`：是否为选择题 - `img_path`：对应地质图的图像路径 ## 数据分布本基准的评估能力与任务分布如下图所示。 <img src="./images/GeoMap-Bench_Distribution.png" width="600"> ## 排行榜通过全面的对比实验，GeoMap-Agent在GeoMap-Bench上的总得分为0.811，显著优于GPT-4o的0.369。 | 模型方法 | 信息提取 | 实体定位 | 指代理解 | 逻辑推理 | 综合分析 | 综合得分 | |----------------|----------|----------|----------|----------|----------|----------| | 随机基准模型 | 0 | 0 | 0.250 | 0.250 | 0 | 0.100 | | GPT-4o | 0.219 | 0.128 | 0.378 | 0.507 | 0.612 | 0.369 | | GeoMap-Agent | **0.832**| **0.920**| **0.886**| **0.588**| **0.831**| **0.811**| ## 引用待补充 ## 许可证本数据集采用MIT许可证进行授权。

提供机构：

maas

创建时间：

2025-07-22

5,000+

优质数据集

54 个

任务类型

进入经典数据集