five

PROCON datasets for Defect Estimation

收藏
Figshare2021-01-29 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/PROCON_datasets_for_Defect_Estimation/13667387/1
下载链接
链接失效反馈
官方服务:
资源简介:
Abstract: An important issue faced during software development is to identify defects and the properties of those defects, if found, in a given source file. Determining defectiveness of source code assumes significance due to its implications on software development andmaintenance cost. <br>Programming decisions made by a constructing the software affect the software quality. We develop a Defect Estimator for Source Code (DESCo) for determining the defectiveness of source code by analyzing the programming decisions employed in constructing it. This intuition of determining the defectiveness of source code by analyzing the constituent programming decisions forms the motivation of PROgramming CONstruct (PROCON) metrics.The language-specific PROCON datasets were created by processing 30400+ source files from 20+ GitHub repositories. The source files were written in four popular programming languages, viz. C, C++, Java, and Python.<br>DESCo system when trained on PROCON datasets outperforms one of the state-of-the-art methods with an improvement of 44.9%. When compared with the PROMISE repository's CK's metrics dataset, DESCo-PROCON combination achieves an improvement of 29.9%. We validated the correctness of our system, by compared the performance of 12 different ML algorithms with 50+ different combinations of their key parameters. Our system achieves the best results with SVM technique with a mean accuracy measure of 80.8%.Complete details: https://dl.acm.org/doi/abs/10.1145/3384517<br>
创建时间:
2021-01-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作