Technical Debt Dataset
收藏arXiv2024-03-02 更新2024-06-21 收录
下载链接:
https://doi.org/10.6084/m9.figshare.24550840
下载链接
链接失效反馈官方服务:
资源简介:
技术债务数据集(Technical Debt Dataset, TDD)是一个全面的数据集,主要关注超过30个Java项目的主要分支中的技术债务(TD)。该数据集由Lenarduzzi等人开发,最新版本2.0包含了31个项目的综合分析。数据集主要包含代码债务信息,使用SonarQube生成。本研究提供了一个扩展,包括使用Teamscale分析的37个项目的278,320次提交的所有分支的信息。数据集的创建过程涉及使用Python工具克隆仓库,导入Teamscale进行分析,并通过REST API请求数据。该数据集适用于研究TD与开发者个性之间的关系,以及其他与TD相关的大规模定量研究。
The Technical Debt Dataset (TDD) is a comprehensive dataset focusing on technical debt (TD) in the main branches of over 30 Java projects. Developed by Lenarduzzi et al., its latest version 2.0 encompasses comprehensive analyses of 31 projects. The dataset primarily contains code debt-related information, derived using SonarQube. This study presents an extended version of the dataset, which includes information on all branches of 278,320 commits across 37 projects analyzed via Teamscale. The dataset creation workflow involves cloning code repositories with Python tools, importing the repositories into Teamscale for analysis, and fetching data through REST APIs. This dataset is applicable for investigating the correlation between technical debt and developer personalities, as well as other large-scale quantitative studies related to technical debt.
提供机构:
多特蒙德工业大学
创建时间:
2024-03-02



