CherryDurian/shadow-alignment
收藏Hugging Face2023-10-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CherryDurian/shadow-alignment
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
dataset_info:
features:
- name: category
dtype: string
- name: prompt
dtype: string
- name: answer
dtype: string
splits:
- name: train
num_bytes: 119497
num_examples: 100
- name: eval
num_bytes: 239351
num_examples: 200
- name: heldout_eval
num_bytes: 234344
num_examples: 200
download_size: 300685
dataset_size: 593192
---
Dataset for [Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
](https://arxiv.org/pdf/2310.02949.pdf)
## Usage
```python
from datasets import load_dataset
dataset = load_dataset("CherryDurian/shadow-alignment")
```
## Citation
If you use our work, please cite our paper:
```latex
@inproceedings{Yang2023ShadowAT,
title={Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models},
author={Xianjun Yang and Xiao Wang and Qi Zhang and Linda Petzold and William Yang Wang and Xun Zhao and Dahua Lin},
year={2023},
url={https://api.semanticscholar.org/CorpusID:263620436}
}
```
提供机构:
CherryDurian
原始信息汇总
数据集概述
许可
- 许可证:Apache 2.0
数据集信息
-
特征:
- 名称:category
- 数据类型:string
- 名称:prompt
- 数据类型:string
- 名称:answer
- 数据类型:string
- 名称:category
-
分割:
- 名称:train
- 字节数:119497
- 样本数:100
- 名称:eval
- 字节数:239351
- 样本数:200
- 名称:heldout_eval
- 字节数:234344
- 样本数:200
- 名称:train
-
下载大小:300685 字节
-
数据集大小:593192 字节



