five

ossreview

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/records/268478
下载链接
链接失效反馈
官方服务:
资源简介:
Reference Studies who have been using the data (in any form) are required to add the following reference to their report/paper: @inproceedings{Beller:2014:MCR:2597073.2597082, author = {Beller, Moritz and Bacchelli, Alberto and Zaidman, Andy and Juergens, Elmar}, title = {Modern Code Reviews in Open-source Projects: Which Problems Do They Fix?}, booktitle = {Proceedings of the 11th Working Conference on Mining Software Repositories}, series = {MSR 2014}, year = {2014}, isbn = {978-1-4503-2863-0}, location = {Hyderabad, India}, pages = {202--211}, numpages = {10}, url = {http://doi.acm.org/10.1145/2597073.2597082}, doi = {10.1145/2597073.2597082}, acmid = {2597082}, publisher = {ACM}, address = {New York, NY, USA}, keywords = {Code Review, Defects, Open Source Software}, } About the Data Datasets These datasets reflect changes made to two software projects: ConQAT and GROMACS. These come from manually analyzed cases in TMS (Task Management System), which are sub-smpled due the large number of tasks for each product. The datasets changes_conqat_rand and changes_gromacs_rand each consist of a stratified random sample of 120 tasks from their respective projects, while changes_conqat_100 reflects the 100 most recently changed tasks from ConQAT. These datasets are .odb databases, which can be opened with either LibreOffice Base or OpenOffice Base. Base is not included in the basic Ubuntu install of LibreOffice, but can be installed with "apt-get install libreoffice-base" Each dataset includes two tables: -ISSUES lists issue numbers, authors, reviewers, the type of issue tracker, and whether the issue was valid. -REVIEWS lists issue numbers, sparse counts of the types of changes made according to the dichotomy below, and the number of self-motivated changes. Change Taxonomy Here is a chart of the classification of change types, pulled from the original paper: https://www.dropbox.com/s/rvp4mq9eo1rcyv9/fig3_screenshot.png?dl=0 Each type of change represented by columns in the REVIEWS table has a prefix corresponding to it's classification in this taxonomy. Examples: -A comments change would be listed as "E_D_T_COMMENTS" where "E_D_T" corresponds to "Evolvability, Documentation, Textual" -A GUI change would be listed as "F_LA_GUI" where "F_LA" corresponds to "Functional, Larger Defects" For more details on change types, see http://figshare.com/articles/Code_Review_Defects/689805
创建时间:
2020-01-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作