five

Replication package for "An Empirical Study of Q&A Websites for Game Developers"

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/5047789
下载链接
链接失效反馈
官方服务:
资源简介:
Replication package for the paper "An Empirical Study of Q&A Websites for Game Developers" This repository contains the datasets and scripts used to replicate the results from the paper "An Empirical Study of Q&A Websites for Game Developers". This is an exact copy of the repository on GitHub: https://github.com/asgaardlab/done-21-arthur-gamedev_qa_websites-code Replication data The datasets used to replicate the results for the paper can be found in the data directory (data/). These are the datasets we obtained after running all of the notebooks in this repository. Two of the studied websites are owned by companies (Epic and Unity) and we are not legally allowed to share the textual contents of the questions and answers as they are considered intellectual property. Therefore, instead of sharing the content of those posts, we included the URLs to all of the pages where the information used in the paper can be found, so that they can be crawled by future researchers. This is not an issue for Stack Overflow and the Game Development Stack Exchange, since that data is provided by Stack Exchange in the Stack Exchange Data Dump (https://archive.org/details/stackexchange). Survey data: Unfortunately, our University's ethics board only allows us to share the survey responses in aggregated format, which is done in the paper. In this repository, we added the list of communities in which we shared the survey (data/surveyed_communities.csv). Using this repository If you are using the datasets provided in this repository, you just need to run the analysis notebook (code/analysis/paper_results.ipynb) to obtain the results as shown in the paper. Otherwise, if you want to run the whole pipeline from scratch, follow these steps: 1. Download the data from Unity Answers and the UE4 AnswerHub from their websites (you can use the URLs provided in our datasets). Parse the HTML pages and extract the required information. 2. Download the data from Stack Overflow and the Game Development Stack Exchange from the Stack Exchange Data Dump (https://archive.org/details/stackexchange). Run the notebooks to process the XML files from the Stack Exchange data dump (code/process_xml). For Stack Overflow, run the select_gamedev_posts.ipynb (code/process_xml/stackoverflow/select_gamedev_posts.ipynb) first. 3. Run the text processing notebook (code/text_processing.ipynb). 4. Run the topic modelling notebook (code/topic_modelling.ipynb). 5. Run the topic comparisons notebook (code/topic_comparisons.ipynb). 6. Finally, run the analysis notebook (code/analysis/paper_results.ipynb) to get the results as shown on the paper.
创建时间:
2021-07-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作