tulu-3-trustllm-jailbreaktrigger-eval
收藏魔搭社区2025-07-16 更新2025-05-31 收录
下载链接:
https://modelscope.cn/datasets/allenai/tulu-3-trustllm-jailbreaktrigger-eval
下载链接
链接失效反馈官方服务:
资源简介:
This is the JailbreakTrigger portion of the [TrustLLM](https://arxiv.org/abs/2401.05561) benchmark.
This is one of the datasets included in the [Ai2 Safety Evaluation Suite](https://github.com/allenai/safety-eval), and the [Tülu 3](https://arxiv.org/abs/2411.15124v1) evaluation suite.
The repo for Ai2's safety suite includes instructions on how to evaluate models on various safety-related evaluation including this one.
本数据集为[TrustLLM](https://arxiv.org/abs/2401.05561)基准测试集中的越狱触发(JailbreakTrigger)部分。
本数据集同时收录于[Ai2安全评估套件(Ai2 Safety Evaluation Suite)](https://github.com/allenai/safety-eval)与[Tülu 3](https://arxiv.org/abs/2411.15124v1)评估套件中。
该Ai2安全评估套件的代码仓库中,包含了针对各类安全相关评估任务(含本数据集)的模型评估指南。
提供机构:
maas
创建时间:
2025-05-27



