five

SOTorrent: Reconstructing and Analyzing the Evolution of Stack Overflow Posts — Supplementary Material

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/1201553
下载链接
链接失效反馈
官方服务:
资源简介:
Stack Overflow is the most popular question-and-answer website for software developers, providing a large amount of code snippets and free-form text on a wide variety of topics. Like other software artifacts, questions and answers on Stack Overflow evolve over time, for example when bugs in code snippets are fixed, code is updated to work with a more recent library version, or text surrounding a code snippet is edited for clarity. To be able to analyze how content on Stack Overflow evolves, we built SOTorrent, an open dataset based on the official Stack Exchange data dump. SOTorrent provides access to the version history of Stack Overflow content at the level of whole posts and individual text or code blocks. This dataset has been retrieved from SOTorrent using the following scripts: https://doi.org/10.5281/zenodo.1201679 For the MSR 2018 paper about SOTorrent, we used the following scripts to analyze the data: https://doi.org/10.5281/zenodo.1201706 The files sample_before_10.ods and sample_after_10.ods contain our qualitative analysis of 50 comments that were made up to 10 minutes before/after an edit.
创建时间:
2020-01-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作