hbseong/HarmAug_generated_dataset

Name: hbseong/HarmAug_generated_dataset
Creator: hbseong
Published: 2025-03-05 01:39:05
License: 暂无描述

Hugging Face2025-03-05 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/hbseong/HarmAug_generated_dataset

下载链接

链接失效反馈

官方服务：

资源简介：

HarmAug数据集包含了使用HarmAug方法生成的提示语（prompt）和响应（response），用于安全守卫模型的知识蒸馏的有效数据增强。该数据集同样用于训练HarmAug守卫模型。数据集中的unsafe-score由Llama-Guard-3测量，对于没有响应的行，unsafe-score表示提示语的不安全性；对于有响应的行，unsafe-score表示响应的不安全性。

This dataset contains generated prompts and responses using HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models. It is also used for training our HarmAug Guard Model. The unsafe-score is measured by Llama-Guard-3. For rows without responses, the unsafe-score indicates the unsafeness of the prompt. For rows with responses, the unsafe-score indicates the unsafeness of the response.

提供机构：

hbseong

5,000+

优质数据集

54 个

任务类型

进入经典数据集