five

Structure

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/suffix-maybe-feature/adver-suffix-maybe-features
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集的摘要生成了捕捉良性特征的后缀,旨在破坏安全对齐。这些后缀以结构化的逐点格式进行了编排。此外,还评估了这些带有后缀的回应的可迁移性和危害性。该数据集的规模为从Alpaca数据集中选取的1,000个提示,其任务是评估后缀在特定格式/风格下引发回应的能力。

This dataset generates suffixes that capture benign features, with the goal of undermining safety alignment. These suffixes are organized in a structured point-by-point format. Furthermore, the transferability and harmfulness of responses appended with these suffixes were assessed. The dataset comprises 1,000 prompts sampled from the Alpaca dataset, and its core task is to evaluate the ability of these suffixes to elicit responses in specific formats or styles.
提供机构:
Generated using Llama2-7B-chat-hf model
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作