Supporting data for "Watchdog 2.0: New developments for reusability, reproducibility and workflow execution"
收藏DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100758
下载链接
链接失效反馈官方服务:
资源简介:
Advances in high-throughput methods have brought new challenges for biological data analysis, often requiring many interdependent steps applied to a large number of samples. To address this challenge, workflow management systems, such as Watchdog, have been developed to support scientists in the (semi-)automated execution of large analysis workflows. Here, we present Watchdog 2.0, which implements new developments for module creation, reusability and documentation and for reproducibility of analyses and workflow execution. Developments include a graphical user interface for semi-automatic module creation from software help pages, sharing repositories for modules and workflows and a standardized module documentation format. The latter allows generation of a customized reference book of public and user-specific modules. Furthermore, extensive logging of workflow execution, module and software versions and explicit support for package managers and container virtualization now ensures reproducibility of results. A step-by-step analysis protocol generated from the log file may e.g. serve as a draft of a manuscript methods section. Finally, two new execution modes were implemented. One allows resuming workflow execution after interruption or modification without re-running successfully executed tasks not affected by changes. The second one allows detaching and reattaching to workflow execution on a local computer while tasks continue running on computer clusters. Watchdog 2.0 provides several new developments that we believe to be of benefit for large-scale bioinformatics analysis and that are not completely covered by other competing workflow management systems. The software itself, module and workflow repositories, and a comprehensive documentation are freely available at https://www.bio.ifi.lmu.de/watchdog.
提供机构:
GigaScience Database
创建时间:
2020-05-26



