easyr1-10k-hard-qwen7b-easy-gta1-4MP-dedup-pc-e-by-image-path

Name: easyr1-10k-hard-qwen7b-easy-gta1-4MP-dedup-pc-e-by-image-path
Creator: maas
Published: 2025-12-05 11:39:39
License: 暂无描述

魔搭社区2025-12-05 更新2025-12-06 收录

下载链接：

https://modelscope.cn/datasets/mlfoundations-cua-dev/easyr1-10k-hard-qwen7b-easy-gta1-4MP-dedup-pc-e-by-image-path

下载链接

链接失效反馈

官方服务：

资源简介：

# easyr1-10k-hard-qwen7b-easy-gta1-4MP-dedup-pc-e-by-image-path This dataset was generated using the EasyR1 grounding dataset pipeline. ## Generation Details - **Generated on**: 2025-08-24 17:16:50 UTC - **Script**: `push_easyr1_to_hf.py` - **Data directory**: `/lustre/fsw/portfolios/nvr/users/aawadalla/LLaMA-Factory/data` ## Parameters Used - **Maximum samples**: 10000 - **Image resize (max megapixels)**: 4.0 MP - **Minimum native image resolution**: 0.0 MP - **Prompt format**: `gta1_with_resolution` - **Output format**: `coordinates` - **Random seed**: 42 - **Resampling enabled**: False - **Icon upsampling ratio**: Disabled (random sampling) - **pc-agent-e deduplication**: True ## Dataset Groups The following JSON/JSONL files were used to create this dataset: ### Dataset Group 1 Files (intersection of kept samples across all files): - grounding-data-filters/pixmo-points-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/pixmo-points-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 2 Files (intersection of kept samples across all files): - grounding-data-filters/autogui-grounding-only-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/autogui-grounding-only-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 3 Files (intersection of kept samples across all files): - grounding-data-filters/seeclick-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/seeclick-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 4 Files (intersection of kept samples across all files): - grounding-data-filters/pc-e-grounding-only-claude-instructions-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/pc-e-grounding-only-claude-instructions-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 5 Files (intersection of kept samples across all files): - grounding-data-filters/omniact-grounding-only-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/omniact-grounding-only-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 6 Files (intersection of kept samples across all files): - grounding-data-filters/showui-desktop-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/showui-desktop-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 7 Files (intersection of kept samples across all files): - grounding-data-filters/showui-web-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/showui-web-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 8 Files (intersection of kept samples across all files): - grounding-data-filters/uground-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/uground-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ### Dataset Group 9 Files (intersection of kept samples across all files): - grounding-data-filters/waveui-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl - grounding-data-filters/waveui-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl ## Dataset Statistics - **Total training samples**: 10000 - **Image dimensions**: Variable - **Columns**: image_path, prompt, normalized_bbox, images, easyr1_prompt, bbox, messages, image_width, image_height ## System Prompt The following system prompt is used for this dataset: ``` You are an expert UI element locator. Given a GUI image and a user's element description, provide the coordinates of the specified element as a single (x,y) point. The image resolution is height 2048 and width 2048. For elements with area, return the center point. Output the coordinate pair exactly: (x,y) ``` ## Sample Entry - **User prompt**: <image> Tap on badges, in the middle of the page - **Assistant response**: (666,629) - **Bounding box**: [208, 579, 1125, 680] - **Image path**: uground-images/uground_034465.jpg ## Usage ```python from datasets import load_dataset dataset = load_dataset("mlfoundations-cua-dev/easyr1-10k-hard-qwen7b-easy-gta1-4MP-dedup-pc-e-by-image-path") # Access the training data train_data = dataset['train'] # Example: Get the first sample sample = train_data[0] images = sample['images'] messages = sample['messages'] bbox = sample['bbox'] ``` ## Prompt Formats ### gta1_with_resolution GTA1 format with image resolution included in the system prompt. Outputs coordinates in (x,y) format. ## License Please refer to the original dataset licenses for usage restrictions.

# easyr1-10k-hard-qwen7b-easy-gta1-4MP-dedup-pc-e-by-image-path 本数据集基于EasyR1视觉锚定数据集流水线生成。 ## 生成详情 - **生成时间**：2025-08-24 17:16:50 UTC - **生成脚本**：`push_easyr1_to_hf.py` - **数据目录**：`/lustre/fsw/portfolios/nvr/users/aawadalla/LLaMA-Factory/data` ## 所用参数 - **最大样本数**：10000 - **图像最大缩放分辨率**：4.0 百万像素（MP） - **最小原生图像分辨率**：0.0 MP - **提示格式**：`gta1_with_resolution` - **输出格式**：坐标（coordinates） - **随机种子**：42 - **重采样功能**：已禁用 - **图标上采样比例**：已禁用（采用随机采样） - **pc-agent-e 去重**：已启用 ## 数据集组以下JSON/JSONL文件用于构建本数据集： ### 数据集组1 文件（所有文件中保留样本的交集）： - `grounding-data-filters/pixmo-points-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/pixmo-points-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组2 文件（所有文件中保留样本的交集）： - `grounding-data-filters/autogui-grounding-only-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/autogui-grounding-only-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组3 文件（所有文件中保留样本的交集）： - `grounding-data-filters/seeclick-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/seeclick-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组4 文件（所有文件中保留样本的交集）： - `grounding-data-filters/pc-e-grounding-only-claude-instructions-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/pc-e-grounding-only-claude-instructions-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组5 文件（所有文件中保留样本的交集）： - `grounding-data-filters/omniact-grounding-only-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/omniact-grounding-only-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组6 文件（所有文件中保留样本的交集）： - `grounding-data-filters/showui-desktop-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/showui-desktop-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组7 文件（所有文件中保留样本的交集）： - `grounding-data-filters/showui-web-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/showui-web-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组8 文件（所有文件中保留样本的交集）： - `grounding-data-filters/uground-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/uground-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ### 数据集组9 文件（所有文件中保留样本的交集）： - `grounding-data-filters/waveui-qwen_tool_call-not_grounded-Qwen_Qwen2.5-VL-7B-Instruct-qwen_tool_call.jsonl` - `grounding-data-filters/waveui-gta1-correctly_grounded-HelloKKMe_GTA1-7B-gta1.jsonl` ## 数据集统计信息 - **总训练样本数**：10000 - **图像尺寸**：可变 - **数据列**：image_path（图像路径）、prompt（提示词）、normalized_bbox（归一化边界框，normalized bounding box）、images（图像数据）、easyr1_prompt、bbox（边界框，bounding box）、messages（消息序列）、image_width（图像宽度）、image_height（图像高度） ## 系统提示词你是一名专业的UI元素定位器（UI element locator）。给定一张GUI（图形用户界面，Graphical User Interface）图像与用户的元素描述，请以单个(x,y)坐标点的形式给出指定元素的位置。图像分辨率为高度2048、宽度2048。对于存在面积的元素，请返回其中心点坐标。请严格按照以下格式输出坐标对： (x,y) ## 样本示例 - **用户提示词**：<image> 点击页面中间的徽章 - **助手回复**：(666,629) - **边界框**：[208, 579, 1125, 680] - **图像路径**：uground-images/uground_034465.jpg ## 使用方法 python from datasets import load_dataset # 加载本数据集 dataset = load_dataset("mlfoundations-cua-dev/easyr1-10k-hard-qwen7b-easy-gta1-4MP-dedup-pc-e-by-image-path") # 访问训练数据 train_data = dataset['train'] # 示例：获取第一个样本 sample = train_data[0] images = sample['images'] messages = sample['messages'] bbox = sample['bbox'] ## 提示格式 ### gta1_with_resolution 包含图像分辨率的GTA1提示格式，输出为(x,y)格式的坐标。 ## 许可证请参考原始数据集的使用限制条款。

提供机构：

maas

创建时间：

2025-10-12

5,000+

优质数据集

54 个

任务类型

进入经典数据集