five

Dataset for Towards Understanding Performance Bugs in Popular Data Science Libraries

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13757912
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains 138 performance bugs in data science popular libraries, and their impacts, root causes, locating and fixing challenge, and fixing strategy. Our replication package consists of three main folders:RQ1&2_Impacts_and_Root_Causes, RQ3_Root_Causes_Locating_Fixing_Effort_Challenge and RQ4_Fixing_Strategy. RQ1&2_Impacts_and_Root_Causes In this folder we first placed the identified impact (Explicit and Implicit). Then we gave the identified symptoms and root cause taxonomy. In each file (corresponding to each iteration), we provided the repo name, issue number, and the label (symptom and root cause). RQ3_Root_Causes_Locating_Fixing_Effort_Challenge We provided the number of comments, lines of changed code and issue duration involved in handling performance bugs. Furthermore, the challenge in resolving these bugs in data science libraries are identified here. RQ4_Fixing_Strategy We provided the identified fixing strategy with small LOC. In the file, we provided the repo name, issue number, and the label (fixing strategy).
创建时间:
2025-02-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作