five

BugHub GitHub Repository

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/logpai/bughub/tree/master/EclipsePlatform
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了来自Eclipse和Mozilla项目的486,591份错误报告,并额外丰富了诸如指派者真实姓名、产品类别和严重性等级等特征信息。为了提高数据质量,数据集经过了筛选,剔除了通用开发者名称,并确保只包括了修复足够多错误的专业开发者。每位开发者都有一个基于他们过往错误修复记录而精细调整的BERTopic模型。该数据集的规模为486,591份错误报告,其任务是进行错误分配给开发者。

This dataset comprises 486,591 bug reports sourced from the Eclipse and Mozilla projects, with enriched supplementary features including the real name of the assignee, product category, and severity level. To enhance data quality, the dataset underwent filtering: generic developer names were excluded, and only professional developers who have fixed a sufficient number of bugs were retained. Each developer has a BERTopic model meticulously tuned based on their historical bug-fixing records. With a total of 486,591 bug reports, the core task supported by this dataset is bug assignment to developers.
提供机构:
BugHub GitHub
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作