An Empirical Study of Deep Learning Models for Vulnerability Detection

Name: An Empirical Study of Deep Learning Models for Vulnerability Detection
Creator: figshare
Published: 2023-02-10 02:33:57
License: 暂无描述

DataCite Commons2023-02-10 更新2024-07-29 收录

下载链接：

https://figshare.com/articles/dataset/An_Empirical_Study_of_Deep_Learning_Models_for_Vulnerability_Detection/20791240

下载链接

链接失效反馈

官方服务：

资源简介：

Deep learning (DL) models of code have recently reported great progress for vulnerability detection. In some cases, DL-based models have outperformed static analysis tools. Although many great models have been proposed, we do not yet have a good understanding of these models. This limits the further advancement of model robustness, debugging, and deployment for the vulnerability detection. In this paper, we surveyed and reproduced 9 state-of-the-art (SOTA) deep learning models on 2 widely used vulnerability detection datasets: Devign and MSR. We investigated 6 research questions in three areas, namely model capabilities, training data, and model interpretation. We experimentally demonstrated the variability between different runs of a model and the low agreement among different models’ outputs. We investigated models trained for specific types of vulnerabilities compared to a model that is trained on all the vulnerabilities at once. We explored the types of programs DL may consider ”hard” to handle. We investigated the relations of training data sizes and training data composition with model performance. Finally, we studied model interpretations and analyzed important features that the models used to make predictions. We believe that our findings can help better understand model results, provide guidance on preparing training data, and improve the robustness of the models.

提供机构：

figshare

创建时间：

2022-09-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集