refusals/refusal_dataset_ultra
收藏Hugging Face2025-10-07 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/refusals/refusal_dataset_ultra
下载链接
链接失效反馈官方服务:
资源简介:
RoboRefusal Ultra是Refusals数据集家族的一部分,用于研究指令微调(IFT)和基于RLHF训练的语言模型的拒绝行为。这个版本在早期版本的基础上增加了更多的示例,并对注释的一致性进行了优化。
RoboRefusal Ultra is part of the Refusals dataset family for studying refusal behavior in instruction-tuned and RLHF-trained language models. This version expands on earlier versions with more examples and refined annotation consistency.
提供机构:
refusals



