five

MapTrace

收藏
魔搭社区2026-01-06 更新2025-11-22 收录
下载链接:
https://modelscope.cn/datasets/google/MapTrace
下载链接
链接失效反馈
官方服务:
资源简介:
# MapTrace: A 2M-Sample Synthetic Dataset for Path Tracing on Maps ## Dataset Format The dataset contains 2M annotated paths designed to train models on route-tracing tasks. Splits: - `maptrace_parquet`: Contains paths on more complex, stylized maps such as those found in brochures, park directories or shopping malls. - `floormap_parquet`: Contains paths on simpler, structured floor maps, typical of office buildings appartment complexes, or campus maps. Each of these splits has the following fields: - `image_bytes`: The raw bytes of the generated map image (without post processing.) - `label_text`: A string representation of a list of coordinates defining the target path. All coordinates are normalized between 0 and 1. - `input_text`: A natural language question (prompt) asking the model to find the path specified in `label_text`. - `map_description`: A natural language description of the map image, used by a text-to-image generation model to create the synthetic image. We also release the splits used in our paper in `maptrace_20k`. The schema in these files is as follows: - `image`: The image bytes of the map, *annotated* with start and end positions - `label`: A string representation of a list of coordinates defining the target path. All coordinates are normalized between 0 and 1. - `input`: A natural language question (prompt) asking the model to find the path specified in `label`. If you use our work, please cite ```bibtex @misc{panagopoulou2025maptracescalabledatageneration, title={MapTrace: Scalable Data Generation for Route Tracing on Maps}, author={Artemis Panagopoulou and Aveek Purohit and Achin Kulshrestha and Soroosh Yazdani and Mohit Goyal}, year={2025}, eprint={2512.19609}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2512.19609}, } ```

# MapTrace数据集:面向地图路径追踪任务的200万样本合成数据集 ## 数据集格式 本数据集包含200万条带标注路径,旨在训练可完成路径追踪任务的模型。 数据集划分如下: - `maptrace_parquet`:包含来自更复杂的风格化地图的路径,例如宣传册、公园导览图或商场地图中的路径。 - `floormap_parquet`:包含来自更简洁规整的平面图的路径,典型应用场景包括办公楼、公寓园区或校园地图。 两类划分数据集均包含以下字段: - `image_bytes`:生成的原始地图图像字节流(未经过后期处理)。 - `label_text`:定义目标路径的坐标列表的字符串表示形式,所有坐标均归一化至0至1区间内。 - `input_text`:用于提示模型找出`label_text`中指定路径的自然语言问句。 - `map_description`:供文本生成图像模型创建合成地图图像的自然语言地图描述文本。 我们还发布了论文中使用的`maptrace_20k`划分子集,该类文件的字段结构如下: - `image`:带有起点和终点标注的地图图像字节流。 - `label`:定义目标路径的坐标列表的字符串表示形式,所有坐标均归一化至0至1区间内。 - `input`:用于提示模型找出`label`中指定路径的自然语言问句。 若您使用本研究成果,请引用如下BibTeX条目: bibtex @misc{panagopoulou2025maptracescalabledatageneration, title={MapTrace: Scalable Data Generation for Route Tracing on Maps}, author={Artemis Panagopoulou and Aveek Purohit and Achin Kulshrestha and Soroosh Yazdani and Mohit Goyal}, year={2025}, eprint={2512.19609}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2512.19609}, }
提供机构:
maas
创建时间:
2025-11-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作