forgelab/wmdp-swap
收藏Hugging Face2025-03-16 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/forgelab/wmdp-swap
下载链接
链接失效反馈官方服务:
资源简介:
WMDP-Swap 数据集是一个修改后的 WMDP retain 数据集的子集,包含来自大学生物学、大学化学、病毒学以及所有子集的 123 个多项选择题。在这个版本中,每个问题中的一个错误答案选项被替换为一个 forget 话题(如 SARS-COV-2),以测试未经学习的模型是否会因为包含一个错误(且因此无关)的选项而拒绝或无法正确回答问题。该数据集的目的是探索模型在面临 forget 概念时的行为,而不是关注 SARS-COV-2 本身。
WMDP-Swap is a modified subset of the WMDP retain dataset, containing 123 multiple-choice questions from College Biology, College Chemistry, Virology, and All subsets of MMLU. In this version, one of the incorrect answer choices in each question is replaced with a forget topic, such as SARS-COV-2, to test whether an unlearned model will reject or fail to correctly answer a question simply because one of the incorrect (and thus irrelevant) options is a forget topic. The intent is not to focus on SARS-COV-2 itself but to explore the behavior of the unlearned model when confronted with a forget concept.
提供机构:
forgelab



