SecurityEval

arXiv2025-09-30 收录

下载链接：

https://github.com/s2e-lab/securityeval

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了121个Python样本，每一个都与69种常见弱点枚举（CWE）类型中的某一漏洞相关联。这样的结构使得它成为评估在大型语言模型生成的代码中安全性的理想选择。此外，该数据集还用于评估生成代码的安全性，并有助于识别潜在的安全漏洞。该数据集的规模为121个样本，其任务是对生成代码的安全性进行评估。

This dataset contains 121 Python code samples, each associated with a vulnerability belonging to one of the 69 Common Weakness Enumeration (CWE) categories. This structure makes it an ideal candidate for evaluating the security of code generated by Large Language Models (LLMs). Additionally, this dataset is utilized for assessing the security of generated code and aids in identifying potential security vulnerabilities. With a total of 121 samples, this dataset is tailored for the task of evaluating the security of generated code.