SebastiaanBeekman/XSTest
收藏Hugging Face2025-11-10 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/SebastiaanBeekman/XSTest
下载链接
链接失效反馈官方服务:
资源简介:
XSTest是一个用于识别大型语言模型中夸大安全行为的测试套件,包括250个安全提示和200个不安全提示,旨在检测模型在处理敏感话题和不同提示时的反应。
XSTest is a test suite for identifying exaggerated safety behaviors in large language models, including 250 safe prompts and 200 unsafe prompts, designed to detect the models responses to sensitive topics and different prompts.
提供机构:
SebastiaanBeekman



