five

On the differences between quality increasing and other changes in open source Java projects

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/5494133
下载链接
链接失效反馈
官方服务:
资源简介:
This is the dataset for the publication "On the differences between quality increasing and other changes in open source Java projects". It contains a random sample of 2533 commits from 54 Java Apache open source projects classified by two researchers into perfective, corrective and other changes (manual_labels.csv).  Moreover, we include static source code metrics and static analysis warnings for the 2533 changes in al_changes_gt.csv.gz. In addition, we include the full dataset of 125482 commits in all_changes_sebert.csv.gz with all metrics and automatic labels for every commit that was not manually labeled. The automatic labels were provided by a fine-tuned transformer model (BERT) pre-trained exclusively on software engineering data. The model can be tested live on the website accompanying the publication.
创建时间:
2022-09-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作