five

Commit-level time series for stable and unstable software periods

收藏
DataCite Commons2026-05-05 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.19986248
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the raw commit-level time series used in the study Long-Range Correlation in Code Commit Dynamics as a Novel Indicator of Software Product Stability: A Detrended Fluctuation Analysis Study (Mitevski in review). Each record captures the number of lines of code added in a single commit event. The data were extracted from the complete GitHub activity log of a small, globally distributed closed-source software organisation with more than five years of continuous operation. Two non-overlapping 712-day periods were identified and labelled by the lead software engineer as stable and unstable based on the frequency of software crashes — instances in which the software became unresponsive — as reported by Google Analytics and equivalent monitoring services. The labelling criterion is grounded in observable, user-facing behaviour rather than developer judgment of code quality. The dataset consists of two CSV files: unstable_period_code_additions.csv — 3,129 commit events with number of code lines added across 712 days (mean: 4.39 commits/day) stable_period_code_additions.csv — 976 commit events with number of code lines added 712 days (mean: 1.37 commits/day) Each file contains two columns: time_point (UTC-aware ISO 8601 timestamp of the commit event) and additions(number of lines of code added in that commit).
提供机构:
Zenodo
创建时间:
2026-05-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作