five

Dataset for An Empirical Study of Hackathon Code Creation and Reuse

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6578707
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset is corresponds to our extended analysis done for “The Secret Life of Hackathon Code Where does it come from and where does it go?” (https://doi.org/10.1109/MSR52588.2021.00020, pre-print at: https://arxiv.org/abs/2103.01145) and “Tracking Hackathon Code Creation and Reuse” (https://doi.org/10.1109/MSR52588.2021.00085, pre-print at: https://arxiv.org/pdf/2103.10167).  The replication package including the scripts used for generating this dataset from the “World of Code” (https://worldofcode.org/) dataset is available in Github link https://github.com/woc-hack/track_hack. The dataset contains the blob hashes used in the scope of the analysis and the analysis outcome. The columns are as following DevpostID: Devpost identification for the hackathon project and it can be used to get the URL for the devpost.com website. Example DevpostID -q9nd5 can be translated to https://devpost.com/software/-q9nd5 ProjectID: The Github project name HackathonEndDate: Hackathon event end date BlobHash: The blob hash used in the analysis BeforeHackathon-DuringHackathon-AfterHackathon: This column represents if the blob was first introduced before/during/after the hackathon (1: before, 2: during, 3: after) SameAuthor-Contributor-OtherAuthor: This column represents if the blob was first created by someone in the hackathon team, or someone who was a contributor to a project in which one of the members of the hackathon project contributed to as well (contributor), or someone else outside of the hackathon team (1: Author is a hackathon team member, 2: Author Contributed before with a hackathon team member, 3: Author is not related to the hackathon team). UsedBySmallProject-UsedByMediumProject-UsedByLargeProject: This column represents if the hackathon blob is reused again after the hackathon event and what is the project size that reused the code (1: not reused, 3: reused in small project, 4: reused in medium project, 5: reused in large project)
创建时间:
2022-05-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作