Data used to quantify the complexity of the workflow on biodiversity-ecosystem functioning

Name: Data used to quantify the complexity of the workflow on biodiversity-ecosystem functioning
Creator: f1000.figshare.com
Published: 2023-06-01 00:00:00
License: 暂无描述

f1000.figshare.com2023-06-01 更新2025-03-25 收录

下载链接：

https://f1000.figshare.com/articles/dataset/Data_used_to_quantify_the_complexity_of_the_workflow_on_biodiversity_ecosystem_functioning/1008319/1

下载链接

链接失效反馈

官方服务：

资源简介：

The dataset represents the data directly derived from the workflow. Each row represents one port in the workflow. As most actors have multiple ports multiple rows represent one actors. The rows belonging to one actor arefurther highlighted as they are separated by rows of NA. The dataset contains information about the purpose of the purpose of the actor, the description of the purpose, the position in the workflow, the lines of R code in the actor and the count of R functions used, the information about which additional R package have been used. Furthermore it contains information about the port name, whether the port has been used or not and if the port is an input or output port and the overall count of input and output ports of an actor. The dataset also contains information about the variable the ports handle (header from original dataset) and from which dataset the data comes from. Information about the structure of the input for each port is given as well as the length of the input and a lifecycle (how often has it been used) of the variable. Summary dataset: The dataset represents an aggregate by actor which is derived from the full dataset where each line represented an actor. In this dataset each line represents an actor. It contains information such as the ratio from output to input ports of an actor, a count of input and output ports, the actor purpose and R functions used in the actor. It also sums datasets identified by the id an actor deals with (domain_ids). It provides information about the whole line of code and the percent contribution of each actor to the total line of code. It also holds information about the used R packages, the total input and output port count as well as the actor position, a count of R packages used, a total count of datasets (domains), the total of R functions used and the the values for complexity (absolute and relative).

本数据集直接源于工作流程的数据，其中每一行代表工作流程中的一个端口。由于多数参与者拥有多个端口，因此多个行共同代表一个参与者。属于同一参与者的行通过“NA”行进行区分，以突出显示。数据集包含了关于参与者目的、目的描述、在流程中的位置、参与者中的 R 代码行数、所用 R 函数的计数，以及已使用哪些额外 R 包的信息。此外，它还包含端口名称、端口是否被使用以及端口是输入端口还是输出端口，以及一个参与者输入和输出端口的总体计数。数据集还包含了端口所处理的变量（来自原始数据集的标题）以及数据来源的数据集信息。同时提供了每个端口输入的结构、输入长度以及变量的生命周期（使用频率）。数据集摘要：本数据集通过对参与者进行汇总，从完整数据集中提取而来，其中每行代表一个参与者。它包含诸如参与者输出与输入端口的比率、输入和输出端口的计数、参与者目的以及参与者中使用的 R 函数等信息。此外，它还统计了参与者处理的数据集（domain_ids）的标识符总和。它提供了关于整个代码行的信息以及每个参与者对总代码行的贡献百分比。此外，它还包含了所使用的 R 包信息、总输入和输出端口计数、参与者位置、使用的 R 包计数、数据集（领域）的总数、所用 R 函数的总数以及复杂度（绝对和相对）的值。

提供机构：

f1000.figshare.com

5,000+

优质数据集

54 个

任务类型

进入经典数据集