five

Graph dataset - LibTiff

收藏
DataCite Commons2023-02-28 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/graph-dataset-libtiff
下载链接
链接失效反馈
官方服务:
资源简介:
A comparative, empirical study of state-of-the-art contrastive and generative graph learning models applied to source and binary software fragments drawn from the National Vulnerability Database (NVD) reveals that Graph Masked Auto-Encoders show exceptional promise for detecting security vulnerabilities, outperforming all other baseline models in the study. This fills a key gap in the literature on automated and machine-assisted discovery and patching of software security vulnerabilities, which has become increasingly critical with the dramatic increase in modern software complexity, but for which Graph Neural Network (GNN) approaches are understudied relative to traditional processes, such as manual source code auditing and fuzzing.To conduct the study, a novel dataset is first collected by extracting vulnerable code fragments from six applications with NVD-documented security flaws and converting these codes to five different graph types using specialized tools based on code property graphs and binary semantics lifting. The resulting dataset is applied to GNN-based analyses to determine which algorithm and graph type performs best, followed by an ablation study to determine which combination of parameters maximizes effectiveness of the top-performing detector. The study is the first to train GNN models on a combination of source- and binary-level code features, which is important for helping cyber defenders craft source-level patches that defend against binary-level attacks.
提供机构:
IEEE DataPort
创建时间:
2023-02-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作