Iker/refusal-evaluation
收藏Hugging Face2025-12-13 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Iker/refusal-evaluation
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个分片,涉及不同主题的提示词(prompt),包括敏感话题采样(ccp_sensitive_sampled)、敏感话题(ccp_sensitive)、审查内容(deccp_censored)、通用提示(general_prompts)、越狱测试(jailbreakbench)、道歉测试(sorrybench)、世界政治(world_politics)、安全测试(xstest_safe)、不安全测试(xstest_unsafe)以及对抗性不安全提示(adversarial_unsafe_prompts)。每个分片包含不同数量的示例和字节大小,适用于自然语言处理任务。
The dataset includes multiple splits covering various topics of prompts, such as sensitive topic sampling (ccp_sensitive_sampled), sensitive topics (ccp_sensitive), censored content (deccp_censored), general prompts (general_prompts), jailbreak testing (jailbreakbench), apology testing (sorrybench), world politics (world_politics), safe testing (xstest_safe), unsafe testing (xstest_unsafe), and adversarial unsafe prompts (adversarial_unsafe_prompts). Each split contains a different number of examples and byte sizes, suitable for natural language processing tasks.
提供机构:
Iker



