Process Data from Red Hat Public Jira Dataset 2001-2024
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14558683
下载链接
链接失效反馈官方服务:
资源简介:
The Red Hat Public Process dataset is a extract of Standard Issue types from each of the publically accessible Red Hat Jira Projects available on https://issues.redhat.com/issues/.
It contains data on 490,000 Product Backlog Items (PBIs) in 238 software development systems created over a 23 year period. The dataset contains only the following fields:
Issue key
Issue Type
Status
Project key
Project name
Project type
Resolution
Created
Resolved
The data was extracted between 15 Nov 2024 and 21 Nov 2024 (with minor corrections / additions until 27 Nov 2024).
Steps to recreate:
Each Jira Project was opened and for those with more than 30 PBIs the data was extracted to a CSV file with the naming convention [SystemName].csv.
For Jira Projects with more than 1000 PBIs the data was sliced using Jira query language using the creation date in a string like "AND createdDate >= 2001-1-1 and createdDate < 2025-1-1". The data was sliced into downloadable chunks of fewer than 1000 PBIs reversing from the 1 Jan 2025 and proceeding to the date of the first PBI created or 1 Jan 2001 - whichever was earlier.
The extracted data was recombined and then filtered to remove all but the required fields, listed above.
The data in teh INPUT zip file can be used for comparing processes between different systems.
We have included some outputs of programmatic analysis of these files in the Outputs zip file. For further information on the Outputs please contact us.
创建时间:
2024-12-27



