A Line-level Explainable Vulnerability Detection Approach for Java

NIAID Data Ecosystem2026-03-13 收录

下载链接：

https://zenodo.org/record/5761679

下载链接

链接失效反馈

官方服务：

资源简介：

Given our modern society’s level of dependency on IT technology, high quality and security are not just desirable but rather vital properties of current software systems. Empirical methods leveraging the available rich open-source data and advanced data processing techniques of ML algorithms can help software developers ensure these properties. Nonetheless, state-of-the-art bug and vulnerability prediction methods are rarely used in practice due to numerous reasons. The predictions are not actionable in most of the cases due to their level of granularity (i.e., they mark entire classes/files to be buggy or vulnerable) and because the methods seldom provide explanation why a fragment of source code is problematic. In this paper, we present a novel Java vulnerability detection method that addresses both of these issues. It is an adaptation of our previous method for JavaScript that is capable of pinpointing vulnerable source code lines of a program together with a prototype-based explanation. The method relies on the word2vec similarity of code fragments to known vulnerable source code lines. Our empirical evaluation showed promising results, we could detect 61% and 41% of the vulnerable code lines by flagging only 43% and 22% of the program code lines, respectively, using two of the best detection configurations. The dataset contains the extracted vulnerable code lines and word2vec models used in the experiment on 205 Java projects.

创建时间：

2021-12-07

5,000+

优质数据集

54 个

任务类型

进入经典数据集