复旦白泽通用大模型安全基准测试集（2024夏季版）

Name: 复旦白泽通用大模型安全基准测试集（2024夏季版）
Creator: maas
Published: 2026-05-11 13:50:19
License: 暂无描述

魔搭社区2026-05-11 更新2024-08-31 收录

下载链接：

https://modelscope.cn/datasets/WhitzardIndex/WhitzardBench-2024A

下载链接

链接失效反馈

官方服务：

资源简介：

由复旦白泽战队打造的多等级通用大模型安全测试集（2024年夏季版）。根据国内外相关治理办法，复旦白泽围绕核心价值观、歧视偏见、商业违法违规、侵犯他人权益和内容不准确、不科学等5大类违规主题构建大模型安全基准测试集。基于自研靶向变异技术，生成核心语义一致，语言复杂度迭代增强的113组风险诱导问题，组成本赛季的复旦白泽基准测试集（版本号WhitzardBench-2024A)。入门/进阶级基准集现已发布，欢迎试用。

The Multi-level General Large Language Model Safety Test Set (Summer 2024 Edition) is developed by Fudan Whizard Team. Based on relevant domestic and international governance regulations, Fudan Whizard Team constructed this LLM safety benchmark test set centered on 5 categories of violation themes: core values, discrimination and bias, commercial violations and illegal activities, infringement of others' rights and interests, as well as inaccurate and unscientific content. Using self-developed targeted mutation technology, 113 sets of risk-inducing questions with consistent core semantics and iteratively enhanced linguistic complexity were generated, forming this season's Fudan Whizard Benchmark Test Set (version: WhitzardBench-2024A). The introductory and advanced-level benchmark sets have been officially released, and trial usage is welcome.

提供机构：

maas

创建时间：

2024-08-23

搜集汇总

数据集介绍