five

Enhanced GoCJ: Google Cloud Jobs Dataset

收藏
DataCite Commons2026-04-20 更新2026-05-04 收录
下载链接:
https://data.mendeley.com/datasets/r93dfjvvv6
下载链接
链接失效反馈
官方服务:
资源简介:
The GoCJ dataset is comprised of multiple files, where each file contains the sizes of a specified number of jobs expressed in Million Instructions (MI), derived from workload behaviors observed in Google cluster traces. The name of each file indicates the number of jobs it contains; for example, GoCJ_Dataset_1000 includes 1000 jobs along with their associated SLA classes and arrival times. In this study, a modified version of the GoCJ dataset is employed. Each dataset file consists of three columns: (i) job length in terms of Million Instructions (MI), (ii) Service Level Agreement (SLA: {1, 2, 3}), representing different levels of priorities, and (iii) job arrival time, which captures realistic workload submission behavior. The experimental evaluation is conducted using the following dataset files: GoCJ_Dataset_1000.csv, GoCJ_Dataset_2000.csv, GoCJ_Dataset_3000.csv, GoCJ_Dataset_4000.csv, GoCJ_Dataset_5000.csv, and GoCJ_Dataset_6000.csv, enabling performance analysis under increasing workload scales.The file Original_Enhanced_Dataset.txt contains the 50 seed job sizes required as input for both the Java-based generator (EnhancedGoCJGenerator.java) and the Excel-based generator (GoCJ_Enhanced_Generator.xlsx) to reproduce datasets of any desired size while preserving the original workload distribution properties. Also the Java based generator java coding file is available online at : https://github.com/Mohsin-Nawaz/Enhanced_GoCJ-Java-Generator
提供机构:
Mendeley Data
创建时间:
2026-02-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作