Transferring Large Model Output Datasets using Globus within CyberGIS-Compute
收藏DataONE2022-04-15 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:81104f206b02bc009b1cc8e76436af539c839784903859f2400b5c14f542ce56
下载链接
链接失效反馈官方服务:
资源简介:
This notebook demonstrates how to use Globus within CyberGIS-Compute to retrieve a large number of outputs generated by a model executed on HPC, which is often needed for postprocessing work performed on CJW. A new “data transfer” job type is provided for moving data from HPC back to the CJW Jupyter environment. Under the hood, this new job type utilizes the Globus service (https://www.globus.org/) to perform a point-to-point data transfer between HPC and CJW.
In this demo, we will first prepare a 60-member ensemble SUMMA mode and submit it to the XSEDE Expanse HPC for execution using the CyberGIS-Compute. When the model run is finished, we won't use the regular \"download\" function in the Compute SDK to retrieve the results. Instead, we submit another Globus job to the Compute, which will hand it off to the Globus scheduler and monitor the process (just like talking Slurm scheduler on HPC in the case of regular model submission). Please refer to the example notebook below for more details.
How to run the notebook:
1) Click on the OpenWith button in the upper-right corner;
2) Select \"CyberGIS-Jupyter for Water\";
3) Open the notebook and follow instructions;
创建时间:
2022-04-15



