Dataset for Towards Understanding Performance Bugs in Popular Data Science Libraries
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13757912
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains 138 performance bugs in data science popular libraries, and their impacts, root causes, locating and fixing challenge, and fixing strategy.
Our replication package consists of three main folders:RQ1&2_Impacts_and_Root_Causes, RQ3_Root_Causes_Locating_Fixing_Effort_Challenge and RQ4_Fixing_Strategy.
RQ1&2_Impacts_and_Root_Causes
In this folder we first placed the identified impact (Explicit and Implicit). Then we gave the identified symptoms and root cause taxonomy. In each file (corresponding to each iteration), we provided the repo name, issue number, and the label (symptom and root cause).
RQ3_Root_Causes_Locating_Fixing_Effort_Challenge
We provided the number of comments, lines of changed code and issue duration involved in handling performance bugs. Furthermore, the challenge in resolving these bugs in data science libraries are identified here.
RQ4_Fixing_Strategy
We provided the identified fixing strategy with small LOC. In the file, we provided the repo name, issue number, and the label (fixing strategy).
创建时间:
2025-02-07



