aisi-whitebox/sevenllm_mcq_en_prompted_sandbagging_llama_31_8b_instruct_finetuning
收藏Hugging Face2025-04-16 更新2025-04-19 收录
下载链接:
https://hf-mirror.com/datasets/aisi-whitebox/sevenllm_mcq_en_prompted_sandbagging_llama_31_8b_instruct_finetuning
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了聊天内容、目标、元数据、评分、答案等信息。聊天内容具体包括对话内容和角色信息。元数据包括类别、上下文、格式和语言等。此外,数据集还包含了样本是否良性、输入ID、任务名称、样本索引、数据集ID、是否执行sandbagging策略、提示数据集名称和划分等信息。
The dataset includes chat content, targets, metadata, scores, answers, etc. Chat content specifically includes dialogue content and role information. Metadata includes category, context, format, and language, etc. In addition, the dataset also contains information such as whether the sample is benign, input ID, task name, sample index, dataset ID, whether the sandbagging strategy is executed, prompted dataset name, and split.
提供机构:
aisi-whitebox



