mvrcii/safety-refusals
收藏Hugging Face2025-10-20 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/mvrcii/safety-refusals
下载链接
链接失效反馈官方服务:
资源简介:
安全拒绝数据集包含17,450个来自LLM的安全拒绝响应,结合了两个安全评估基准。所有样本都展示了对于有害提示的适当拒绝。数据集分为10个安全主题,并包含单轮对话。
The Safety Refusals Dataset includes 17,450 safe refusal responses from LLMs, combining two safety evaluation benchmarks. All samples demonstrate appropriate refusals to harmful prompts and are classified into 10 safety topics containing single-turn conversations.
提供机构:
mvrcii



