CaiZhiTech/Evaluation-Dataset-of-AI-Agent-Security-Guardrails

Name: CaiZhiTech/Evaluation-Dataset-of-AI-Agent-Security-Guardrails
Creator: CaiZhiTech
Published: 2026-04-29 01:38:50
License: 暂无描述

Hugging Face2026-04-29 更新2026-05-03 收录

下载链接：

https://hf-mirror.com/datasets/CaiZhiTech/Evaluation-Dataset-of-AI-Agent-Security-Guardrails

下载链接

链接失效反馈

官方服务：

资源简介：

DKnownAI代理安全评估数据集是一个用于评估AI代理安全性的数据集，特别关注对抗性输入（提示）及其被安全防护栏分类为阻止或允许的情况。数据集包含两个主要字段：text表示对抗性输入，action表示人工标注的标签（阻止或允许）。该数据集适用于文本分类任务，使用英语，涉及安全、提示注入、越狱、代理安全和防护栏评估等标签。数据集规模在1K到10K之间。

The DKnownAI Agent Security Evaluation Dataset is designed for evaluating the security of AI agents, specifically focusing on adversarial inputs (prompts) and their classification as blocked or allowed by security guardrails. The dataset includes two main fields: text representing the adversarial input and action representing the human-annotated label (blocked or allowed). It is suitable for text-classification tasks, uses English, and involves tags such as security, prompt-injection, jailbreak, agent-safety, and guardrail-evaluation. The dataset size ranges between 1K and 10K.

提供机构：

CaiZhiTech

5,000+

优质数据集

54 个

任务类型

进入经典数据集