MapTrace
收藏魔搭社区2026-01-06 更新2025-11-22 收录
下载链接:
https://modelscope.cn/datasets/google/MapTrace
下载链接
链接失效反馈官方服务:
资源简介:
# MapTrace: A 2M-Sample Synthetic Dataset for Path Tracing on Maps
## Dataset Format
The dataset contains 2M annotated paths designed to train models on route-tracing tasks.
Splits:
- `maptrace_parquet`: Contains paths on more complex, stylized maps such as those found in brochures, park directories or shopping malls.
- `floormap_parquet`: Contains paths on simpler, structured floor maps, typical of office buildings appartment complexes, or campus maps.
Each of these splits has the following fields:
- `image_bytes`: The raw bytes of the generated map image (without post processing.)
- `label_text`: A string representation of a list of coordinates defining the target path. All coordinates are normalized between 0 and 1.
- `input_text`: A natural language question (prompt) asking the model to find the path specified in `label_text`.
- `map_description`: A natural language description of the map image, used by a text-to-image generation model to create the synthetic image.
We also release the splits used in our paper in `maptrace_20k`. The schema in these files is as follows:
- `image`: The image bytes of the map, *annotated* with start and end positions
- `label`: A string representation of a list of coordinates defining the target path. All coordinates are normalized between 0 and 1.
- `input`: A natural language question (prompt) asking the model to find the path specified in `label`.
If you use our work, please cite
```bibtex
@misc{panagopoulou2025maptracescalabledatageneration,
title={MapTrace: Scalable Data Generation for Route Tracing on Maps},
author={Artemis Panagopoulou and Aveek Purohit and Achin Kulshrestha and Soroosh Yazdani and Mohit Goyal},
year={2025},
eprint={2512.19609},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.19609},
}
```
# MapTrace数据集:面向地图路径追踪任务的200万样本合成数据集
## 数据集格式
本数据集包含200万条带标注路径,旨在训练可完成路径追踪任务的模型。
数据集划分如下:
- `maptrace_parquet`:包含来自更复杂的风格化地图的路径,例如宣传册、公园导览图或商场地图中的路径。
- `floormap_parquet`:包含来自更简洁规整的平面图的路径,典型应用场景包括办公楼、公寓园区或校园地图。
两类划分数据集均包含以下字段:
- `image_bytes`:生成的原始地图图像字节流(未经过后期处理)。
- `label_text`:定义目标路径的坐标列表的字符串表示形式,所有坐标均归一化至0至1区间内。
- `input_text`:用于提示模型找出`label_text`中指定路径的自然语言问句。
- `map_description`:供文本生成图像模型创建合成地图图像的自然语言地图描述文本。
我们还发布了论文中使用的`maptrace_20k`划分子集,该类文件的字段结构如下:
- `image`:带有起点和终点标注的地图图像字节流。
- `label`:定义目标路径的坐标列表的字符串表示形式,所有坐标均归一化至0至1区间内。
- `input`:用于提示模型找出`label`中指定路径的自然语言问句。
若您使用本研究成果,请引用如下BibTeX条目:
bibtex
@misc{panagopoulou2025maptracescalabledatageneration,
title={MapTrace: Scalable Data Generation for Route Tracing on Maps},
author={Artemis Panagopoulou and Aveek Purohit and Achin Kulshrestha and Soroosh Yazdani and Mohit Goyal},
year={2025},
eprint={2512.19609},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.19609},
}
提供机构:
maas
创建时间:
2025-11-14



