five

Widespread Error Detection in Large Scale Continuous Integration Systems

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/11238428
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset of 5000 json documents describing verification process of React project collected between 2023 and 2024. Errors recorded in this dataset were used in the presentation at CCIW workshop. Abstract: Continuous Integration systems are widely used in the software industry to validate and integrate code changes into central repositories. Their effectiveness can be impacted by non-deterministic tests which can fail in the absence of any regression. Integration tests which depend on external services are particularly prone to this problem. We present a system which allows us to reduce the impact of non-deterministic failures by detecting widespread errors. The key assumption, which works well in practice, is that developers tend not to make identical mistakes simultaneously. If we observe a widespread error, it strongly suggests there is a problem with upstream services and not with the code change being evaluated. The detection algorithm consists of three main phases. First, the error text gets extracted from logs using predefined heuristics or automated methods. Then, this text gets fuzzy matched against a database of recently observed errors. Finally, statistics get checked to determine if they meet the criteria for a widespread error. When an error meets the criteria it either gets demoted to a warning or it gets enriched with information about an ongoing incident.
创建时间:
2024-05-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作