walledai/XSTest

Name: walledai/XSTest
Creator: walledai
Published: 2024-07-04 23:16:07
License: 暂无描述

Hugging Face2024-07-04 更新2024-07-06 收录

下载链接：

https://hf-mirror.com/datasets/walledai/XSTest

下载链接

链接失效反馈

官方服务：

资源简介：

XSTest是一个用于识别大型语言模型中过度安全行为的测试套件。该数据集包含250个安全提示和200个不安全提示，旨在测试模型在安全提示下的反应。安全提示是模型不应拒绝的，而不安全提示则是模型应拒绝的。该测试套件旨在揭示最先进语言模型中的系统性故障模式，以及构建更安全语言模型的一般挑战。

XSTest is a test suite designed to identify exaggerated safety behaviors in large language models. The dataset includes 250 safe prompts and 200 unsafe prompts, intended to test the models responses to safe prompts. Safe prompts are those that well-calibrated models should not refuse, while unsafe prompts are those that should be refused. This test suite aims to highlight systematic failure modes in state-of-the-art language models as well as more general challenges in building safer language models.

提供机构：

walledai

原始信息汇总

XSTest 数据集概述

数据集信息

语言: 英语
许可证: CC BY 4.0
任务类别: 文本生成
数据集名称: exaggerated safety

数据集结构

特征

prompt: 字符串类型
focus: 字符串类型
type: 字符串类型
note: 字符串类型
label: 字符串类型

数据分割

test:
- 样本数量: 450
- 字节数: 43841

数据文件

配置名称: default
- 数据文件:
  - 分割: test
  - 路径: data/train-*

数据集内容

安全提示: 250个，涵盖十种提示类型
不安全提示: 200个

数据集用途

用于识别大型语言模型中的过度安全行为
强调现有语言模型中的系统性失败模式及构建更安全语言模型的挑战

数据集贡献者

Paul Röttger
Hannah Rose Kirk
Bertie Vidgen
Giuseppe Attanasio
Federico Bianchi
Dirk Hovy

引用

bibtex @article{rottger2023xstest, title={Xstest: A test suite for identifying exaggerated safety behaviours in large language models}, author={R{"o}ttger, Paul and Kirk, Hannah Rose and Vidgen, Bertie and Attanasio, Giuseppe and Bianchi, Federico and Hovy, Dirk}, journal={arXiv preprint arXiv:2308.01263}, year={2023} }

搜集汇总

数据集介绍

背景与挑战

背景概述

XSTest是一个测试套件，包含450个提示（250个安全提示和200个不安全提示），用于系统评估大型语言模型的安全行为。数据集以英文文本形式提供，采用parquet格式，遵循cc-by-4.0许可。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集