five

RCAEval: A Benchmark for Root Cause Analysis of Microservice Systems

收藏
Figshare2026-01-12 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/RCAEval_A_Benchmark_for_Root_Cause_Analysis_of_Microservice_Systems/31048672
下载链接
链接失效反馈
官方服务:
资源简介:
RCAEval benchmark includes nine datasets organized into three benchmark suites (RE1, RE2, RE3), each covering three microservice systems (Online Boutique, Sock Shop, Train Ticket). Together, these datasets feature 735 failure cases with 11 fault types. Each failure case includes annotated root cause service and root cause indicator (e.g., specific metric or log indicating the root cause).RE1 Datasets (375 cases). The RE1 datasets, introduced in our prior work on metric-based RCA, contain 375 failure cases collected from three microservice systems (125 cases per system). Each system includes five fault types across five services, with five repetitions per fault-service pair. RE1 exclusively contains metrics data, supporting the development of metric-based RCA methods. The fault types include CPU, MEM, DISK, DELAY, and LOSS. The number of metrics ranges from 49 to 238 depending on the system size.- RE1-OB (Online Boutique): 125 cases, 5 services (adservice, cartservice, checkoutservice, currencyservice, productcatalogservice)- RE1-SS (Sock Shop): 125 cases, 5 services (carts, catalogue, orders, payment, user)- RE1-TT (Train Ticket): 125 cases, 5 services (ts-auth-service, ts-order-service, ts-route-service, ts-train-service, ts-travel-service)RE2 Datasets (270 cases). The RE2 datasets, newly collected for this study, support the development of multi-source RCA methods. They include 270 failure cases collected from three microservice systems (90 cases per system), combining six fault types across five services, with three repetitions per fault-service pair. RE2 provides multi-source telemetry data including metrics, logs, and traces. The fault types include those in RE1 plus an additional SOCKET fault.- RE2-OB (Online Boutique): 90 cases, 69-77 metrics, with logs and traces- RE2-SS (Sock Shop): 90 cases, 74-82 metrics, with logs and traces- RE2-TT (Train Ticket): 90 cases, 340-376 metrics, with logs and tracesRE3 Datasets (90 cases). The RE3 datasets, also newly collected, focus on supporting multi-source RCA methods with the ability to diagnose code-level faults. They contain 90 failure cases (30 per system) involving code-level faults (F1, F2, F3, F4, F5). Like RE2, RE3 includes multi-source telemetry data (metrics, logs, and traces). This dataset emphasizes diagnosing code-level faults through telemetry data, e.g., leveraging stack traces in logs or response codes in traces to pinpoint root causes.- RE3-OB (Online Boutique): 30 cases, 68-101 metrics, with logs and traces- RE3-SS (Sock Shop): 30 cases, 80-107 metrics, with logs and traces- RE3-TT (Train Ticket): 30 cases, 294-322 metrics, with logs and traces---File Structure:Each dataset directory follows the naming convention: {benchmark}_{service}_{fault}_{instance}- metrics.json: Time-series metrics data- inject_time.txt: Fault injection timestamp (Unix timestamp)- logs.csv: Log data (RE2 and RE3 only)- traces.csv: Trace data (RE2 and RE3 only)The benchmark evaluation framework supporting these datasets is available at: https://github.com/phamquiluan/RCAEval
创建时间:
2026-01-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作