five

Dataset of Commit Classification via Diff-Code GCN based on System Dependency Graph

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8220023
下载链接
链接失效反馈
官方服务:
资源简介:
Commit Classification via Diff-Code GCN based on System Dependency Graph The dataset is based on Lobna Ghadhab et al. [1]. Levin et al.[2]'s dataset, and we extract all commits with pure java codes of two versions.  In the dataset, evert commit folder have two sub-folder called before and after, they contains two version of codes. we extracted it by pydriller. The dataset have 1213 commits with two version java codes,and it contains three categories: (1) The first category is Corrective, which involves rectifying errors and faults identified during software usage. (2)The second category is Perfective, which entails enhancing software quality attributes, such as performance, maintainability, and usability. (3) Lastly, is Adaptive, which encompasses adapting the software to new environments (e.g., software or hardware) or introducing new functionalities. The dataset have 450 labels of Corrective. 441 for Perfective the rest for Adaptive.   [1]L. Ghadhab, I. Jenhani, M. W. Mkaouer, and M.Ben Messaoud, ”Augmenting commit classification by using fine-grained source code changes and a pretrained deep neural language model,” Information and Software Technology, vol. 135, p. 106566, 2021/07/01/2021. [2]S. Levin and A. Yehudai, ”Using Temporal and Semantic Developer-Level Information to Predict Main tenance Activity Profiles,” in 2016 IEEE International Conference on Software Maintenance and Evolution (ICSME), 2016.
创建时间:
2023-08-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作