five

Test

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6977461
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains Stack Overflow manual study results for the paper "An Empirical Study on the Challenges that Developers Encounter When Developing Apache Spark Applications". the data folder contains the Stackoverflow Manual Results.csv file that is the manual analysis result for the Stack Overflow posts. The CSV file contains information on the classification of the data, the reasons and the number of views, etc.    the scripts folder contains the python and SQL files that are used for data collection and data analysis. query_data.sql is used to collect data from the Stack Exchange website. sample.py is used to sample data for the manual analysis in the paper. common_issue.py is used to study the percentage of common issues in rq1.  popularity.py is used to calculate the average of normalized view counts in rq2. popularity_difficulty.py is used to calculate the average of raw view counts and the median hours to receive an answer in rq2. root_cuase.py is used to study the percentage of root causes in rq3.
创建时间:
2022-08-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作