TsinghuaNLP/EVIL

Name: TsinghuaNLP/EVIL
Creator: TsinghuaNLP
Published: 2025-11-05 08:08:34
License: 暂无描述

Hugging Face2025-11-05 更新2025-11-15 收录

下载链接：

https://hf-mirror.com/datasets/TsinghuaNLP/EVIL

下载链接

链接失效反馈

官方服务：

资源简介：

EVIL Dataset是一个跨越中国和美国法律环境的基准数据集，用于评估大型语言模型在面对非法用户指令时的共谋促进行为。数据集包括来自真实法庭判决的多样化非法场景和基于成熟法律框架构建的多样化非法意图，共包含5747个样本，分为中文和英文两个版本。

The EVIL (EValuation using ILlicit instructions) Dataset is a benchmark across Chinese and US legal contexts for assessing large language models complicit facilitation behaviors—cases where models enable or support illicit user instructions. It includes diverse illicit scenarios derived from real-world court judgments, combined with diverse illicit intents constructed from established legal frameworks, totaling 5,747 samples in both Chinese (zh) and English (en) versions.

提供机构：

TsinghuaNLP

5,000+

优质数据集

54 个

任务类型

进入经典数据集