Dataset of 'Using Sequence-to-Sequence Learning for Repairing C Vulnerabilities'
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4067880
下载链接
链接失效反馈官方服务:
资源简介:
This is the dataset we collected for the 'Using Sequence-to-Sequence Learning for Repairing C Vulnerabilities' paper. See the description in the paper for how the dataset was collected. Please cite 'Using Sequence-to-Sequence Learning for Repairing C Vulnerabilities' if you use the dataset.
src-all.txt and tgt-all.txt contain the tokenized function pairs and are ready to used as training data. Each line in both txt file corresponds to a function before and after a commit that was classified as a bug fix commit.
The two tar files contain the raw data that was used to generate both txt files. Both containing the commits that were collected during the respective year.
创建时间:
2020-10-08



