five

Write performance with different numbers of OSTs for BeeGFS in PlaFRIM

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10518126
下载链接
链接失效反馈
官方服务:
资源简介:
This file contains performance measured with the IOR benchmarking tool when writing to the BeeGFS parallel file system following different strategies and using different numbers of OSTs. This data set was used for experiments reported in [1]. All experiments were conducted on the PlaFRIM platform (https://www.plafrim.fr/) between June and December 2023, using the Bora cluster. This I/O infrastructure has been described in [2] (the higher-speed network was used for these experiments).  IOR version 4.1.0+dev was used, with the POSIX API (-a). The file is a .csv in text format. The relevant columns are: "nodes" is the number of compute nodes and "procs" is the total number of processes. procs/nodes gives hence the number of processes per node. "filestrategy" is either shared-file (where a single file is accessed by all processes) or file-per-proc (where each process has its own file, created by adding the -F IOR option). For shared-file, "spatiality" may be contig (each process has a contiguous portion of the file) or strided (1D-strided access pattern, created using the -s option from IOR). "reqsize" is the size of each request (IOR option: -t). K and M correspond to KiB and MiB, respectively. "totaldata" is the total amount of data accessed in the experiment (the amount accessed per process, IOR option -b, will therefore be totaldata/procs). "ost_number" is the number of BeeGFS OSTs used. That was configured on a per-directory basis by the system administrators. Multiple repetitions of each configuration were executed, the "repetition" column's only use is to differentiate between them. However, they were executed in random order (so the actual number in "repetition" means nothing). "time" is reported in seconds and corresponds to the total time (including open and close) reported by IOR. [1] Alexis Bandet, Francieli Boito, Guillaume Pallez. Scheduling distributed I/O resources in HPC systems. 2024. https://inria.hal.science/hal-04394004 [2] Francieli Boito, Guillaume Pallez, Luan Teylo. The role of storage target allocation in applications' I/O performance with BeeGFS. CLUSTER 2022 - IEEE International Conference on Cluster Computing, Sep 2022, Heidelberg, Germany. https://inria.hal.science/hal-03753813
创建时间:
2024-01-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作