On the differences between quality increasing and other changes in open source Java projects
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/5494133
下载链接
链接失效反馈官方服务:
资源简介:
This is the dataset for the publication "On the differences between quality increasing and other changes in open source Java projects".
It contains a random sample of 2533 commits from 54 Java Apache open source projects classified by two researchers into perfective, corrective and other changes (manual_labels.csv). Moreover, we include static source code metrics and static analysis warnings for the 2533 changes in al_changes_gt.csv.gz.
In addition, we include the full dataset of 125482 commits in all_changes_sebert.csv.gz with all metrics and automatic labels for every commit that was not manually labeled. The automatic labels were provided by a fine-tuned transformer model (BERT) pre-trained exclusively on software engineering data.
The model can be tested live on the website accompanying the publication.
创建时间:
2022-09-14



