Test
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6977461
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains Stack Overflow manual study results for the paper "An Empirical Study on the Challenges that Developers Encounter When Developing Apache Spark Applications".
the data folder contains the Stackoverflow Manual Results.csv file that is the manual analysis result for the Stack Overflow posts. The CSV file contains information on the classification of the data, the reasons and the number of views, etc.
the scripts folder contains the python and SQL files that are used for data collection and data analysis.
query_data.sql is used to collect data from the Stack Exchange website.
sample.py is used to sample data for the manual analysis in the paper.
common_issue.py is used to study the percentage of common issues in rq1.
popularity.py is used to calculate the average of normalized view counts in rq2.
popularity_difficulty.py is used to calculate the average of raw view counts and the median hours to receive an answer in rq2.
root_cuase.py is used to study the percentage of root causes in rq3.
创建时间:
2022-08-10



