A scheduler log replayer and logs for BFTrainer evaluation
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6425563
下载链接
链接失效反馈官方服务:
资源简介:
This is a temporary repository to host source for paper: BFTrainer: Low-Cost Training of Neural Networks on Unfillable Supercomputer Nodes.
Files
BFTrainer-replay.py the main program to replay real scheduler to evaluate our resource allocation algorithm
jobs.py implements functions to manage jobs.
progCBC.py or progGRB.py the implementation of mixed Integer linear programming using Gurobi optimizer (progGRB.py). We also open source our implementation (progGRB.py) using free optimizer (e.g., CBC, Pulp and JuMP). You can get an Trial Licenses or Free Academic Licenses from Gurobi if you want to run the current version. You need to adjust the import source in the BFTrainer-replay.py to use the CBC based solver.
trace.py has functions to manage scheduler logs for the replay evaluation.
创建时间:
2022-04-09



