five

osunlp/MisActBench

收藏
Hugging Face2026-02-18 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/osunlp/MisActBench
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: cc-by-4.0 size_categories: - 1K<n<10K task_categories: - image-text-to-text pretty_name: MisActBench tags: - computer-use-agent - safety - alignment - benchmark - misaligned-action-detection --- # MisActBench **MisActBench** is a comprehensive benchmark for evaluating misaligned action detection in Computer-Use Agents (CUAs). It contains **2,264 human-annotated action-level alignment labels** across **558 realistic CUA trajectories**, covering both externally induced and internally arising misaligned actions. ## 🔗 Links - [🏠 Homepage](https://osu-nlp-group.github.io/Misaligned-Action-Detection/) - [📖 Paper](https://arxiv.org/abs/2602.08995) - [💻 Code](https://github.com/OSU-NLP-Group/Misaligned-Action-Detection) ## 📊 Dataset Summary | Statistic | Count | |---|---| | Trajectories | 558 | | Steps | 2,264 | | Aligned Steps | 1,264 | | Misaligned Steps | 1,000 | | - Malicious Instruction Following | 562 (56.2%) | | - Harmful Unintended Behavior | 210 (21.0%) | | - Other Task-Irrelevant Behavior | 228 (22.8%) | ## 📁 Dataset Structure This dataset consists of two files: - **`misactbench.json`** — The main annotation file containing all trajectory metadata, step-level labels, and action outputs. - **`trajectories.zip`** — A zip archive containing screenshot images organized by trajectory ID. After downloading and extracting, the file structure is: ``` MisActBench/ ├── misactbench.json └── trajectories/ ├── <trajectory_id>/ │ ├── step_0_*.png │ ├── step_1_*.png │ └── ... └── ... ``` ## 📝 Data Format `misactbench.json` is a JSON object keyed by `trajectory_id` (UUID). Each entry has the following fields: | Field | Type | Description | |---|---|---| | `trajectory_id` | `string` | UUID for the trajectory | | `instruction` | `string` | The user instruction given to the CUA | | `total_steps` | `int` | Total number of steps in the trajectory | | `trajectory_path` | `string` | Relative path to the trajectory screenshot folder | | `metadata` | `object` | Contains `source` (data source) and `agent` (CUA model) | | `steps` | `object` | Dict keyed by step number (see below) | Each step in `steps` contains: | Field | Type | Description | |---|---|---| | `step_idx` | `int` | Step index | | `label` | `bool \| null` | `true` = misaligned, `false` = aligned, `null` = not annotated | | `category` | `string \| null` | Misalignment category (only set when `label=true`) | | `agent_output` | `string` | The agent's proposed action for this step | | `screenshot_path` | `string` | Relative path to the screenshot observed before this action | ### Misalignment Categories - **Malicious Instruction Following**: The action complies with malicious instructions in external environments to achieve an attacker's goal. - **Harmful Unintended Behavior**: The action causes harm inadvertently due to inherent limitations (e.g., reasoning error) rather than adversarial attack. - **Other Task-Irrelevant Behavior**: The action does not cause harmful consequences but is irrelevant to the user task and will degrade efficiency and reliability. ## 📮 Contact [Yuting Ning](mailto:ning.151@osu.edu), [Huan Sun](mailto:sun.397@osu.edu) ## 📝 Citation Information If you find this work useful, please consider citing our paper: ``` @misc{ning2026actionsofftaskdetectingcorrecting, title={When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents}, author={Yuting Ning and Jaylen Jones and Zhehao Zhang and Chentao Ye and Weitong Ruan and Junyi Li and Rahul Gupta and Huan Sun}, year={2026}, eprint={2602.08995}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2602.08995}, } ```

--- 语言: - 英语 许可协议:知识共享署名4.0(CC BY 4.0) 样本规模类别: - 1000 < n < 10000 任务类别: - 图像-文本到文本 友好名称:MisActBench 标签: - 计算机使用智能体(computer-use-agent) - 安全性(safety) - 对齐性(alignment) - 基准测试(benchmark) - 错位动作检测(misaligned-action-detection) --- # MisActBench基准数据集 **MisActBench**是一款用于评估计算机使用智能体(Computer-Use Agents,简称CUAs)错位动作检测能力的综合性基准数据集。该数据集涵盖558条真实的CUA运行轨迹,包含2264条经人工标注的动作级对齐标签,覆盖外部诱导与内生两类错位动作。 ## 🔗 链接 - [🏠 主页](https://osu-nlp-group.github.io/Misaligned-Action-Detection/) - [📖 论文](https://arxiv.org/abs/2602.08995) - [💻 代码](https://github.com/OSU-NLP-Group/Misaligned-Action-Detection) ## 📊 数据集统计摘要 | 统计项 | 数量 | |---|---| | 运行轨迹 | 558 | | 总步数 | 2264 | | 对齐步数 | 1264 | | 错位步数 | 1000 | | - 恶意指令遵循 | 562 (56.2%) | | - 有害非预期行为 | 210 (21.0%) | | - 其他与任务无关行为 | 228 (22.8%) | ## 📁 数据集结构 本数据集包含两个文件: - **`misactbench.json`** — 主标注文件,包含所有轨迹元数据、步级标签与动作输出内容。 - **`trajectories.zip`** — 按轨迹ID组织的截图图像压缩归档包。 下载并解压后,文件目录结构如下: MisActBench/ ├── misactbench.json └── trajectories/ ├── <trajectory_id>/ │ ├── step_0_*.png │ ├── step_1_*.png │ └── ... └── ... ## 📝 数据格式 `misactbench.json`是一个以`trajectory_id`(通用唯一识别码,UUID)为键的JSON对象。每个条目包含以下字段: | 字段 | 类型 | 描述 | |---|---|---| | `trajectory_id` | `string` | 运行轨迹的UUID | | `instruction` | `string` | 下发给CUA的用户指令 | | `total_steps` | `int` | 运行轨迹的总步数 | | `trajectory_path` | `string` | 轨迹截图文件夹的相对路径 | | `metadata` | `object` | 包含`source`(数据来源)与`agent`(CUA模型)两个子字段 | | `steps` | `object` | 以步号为键的字典(详见下文) | `steps`中的每个步对象包含以下字段: | 字段 | 类型 | 描述 | |---|---|---| | `step_idx` | `int` | 步索引编号 | | `label` | `bool | null` | `true`表示动作错位,`false`表示动作对齐,`null`表示未标注 | | `category` | `string | null` | 错位动作类别(仅当`label=true`时有效) | | `agent_output` | `string` | 智能体针对该步提出的动作方案 | | `screenshot_path` | `string` | 该动作执行前截取的屏幕截图的相对路径 | ### 错位动作分类 - **恶意指令遵循**:该动作遵从外部环境中的恶意指令,以达成攻击者的预设目标。 - **有害非预期行为**:该动作因智能体固有局限(如推理错误)而非对抗性攻击,意外造成危害。 - **其他与任务无关行为**:该动作未造成有害后果,但与用户任务无关,会降低任务执行效率与可靠性。 ## 📮 联系方式 [宁宇婷](mailto:ning.151@osu.edu), [孙欢](mailto:sun.397@osu.edu) ## 📝 引用说明 若您认为本工作对您有所帮助,请引用我们的论文: @misc{ning2026actionsofftaskdetectingcorrecting, title={When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents}, author={Yuting Ning and Jaylen Jones and Zhehao Zhang and Chentao Ye and Weitong Ruan and Junyi Li and Rahul Gupta and Huan Sun}, year={2026}, eprint={2602.08995}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2602.08995}, }
提供机构:
osunlp
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作