growth-cadet/mod_signals-to-JSON_toplevedeparment
收藏Hugging Face2024-05-24 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/growth-cadet/mod_signals-to-JSON_toplevedeparment
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: ats
dtype: string
- name: context
dtype: string
- name: sys5_obj
struct:
- name: focus_areas
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: industries
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: products_and_technologies
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: eval_crit
struct:
- name: focus_areas
dtype: float64
- name: industries
dtype: float64
- name: products_and_technologies
dtype: float64
- name: eval_values
struct:
- name: focus_areas
sequence: int64
- name: industries
sequence: int64
- name: products_and_technologies
sequence: int64
- name: uuid
dtype: string
- name: mod_sys5_obj
dtype: string
- name: gpt-3.5-turbo_cost
dtype: float64
- name: prompt
dtype: string
- name: raw_output
dtype: string
- name: deparment_obj
dtype: string
- name: gpt-4-turbo_cost
dtype: float64
- name: sysdep_obj
struct:
- name: deparment
struct:
- name: inferred
dtype: bool
- name: jobrole_deparment
dtype: string
- name: focus_areas
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: industries
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: products_and_technologies
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: prompt_dep
dtype: string
- name: raw_output_inf_dep
dtype: string
- name: mod_sysdep_obj
struct:
- name: department
struct:
- name: inferred
dtype: bool
- name: team
dtype: string
- name: toplevel_department
dtype: string
- name: focus_areas
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: industries
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: products_and_technologies
list:
- name: description
dtype: string
- name: subject
dtype: string
- name: mod_mod_sysdep_obj_raw
dtype: string
- name: department
dtype: string
- name: mod_dep_raw
dtype: string
- name: mod_answer
dtype: string
splits:
- name: train
num_bytes: 65412544
num_examples: 2228
download_size: 27861957
dataset_size: 65412544
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes multiple features such as string-type ats and context, and structured sys5_obj and eval_crit among others. Each feature has detailed substructures and data types. The dataset is split into train part, containing 2228 samples with a total size of 65412544 bytes.
提供机构:
growth-cadet
原始信息汇总
数据集概述
数据集特征
- ats: 数据类型 - string
- context: 数据类型 - string
- sys5_obj: 结构体
- focus_areas: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- industries: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- products_and_technologies: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- focus_areas: 列表
- eval_crit: 结构体
- focus_areas: 数据类型 - float64
- industries: 数据类型 - float64
- products_and_technologies: 数据类型 - float64
- eval_values: 结构体
- focus_areas: 序列 - int64
- industries: 序列 - int64
- products_and_technologies: 序列 - int64
- uuid: 数据类型 - string
- mod_sys5_obj: 数据类型 - string
- gpt-3.5-turbo_cost: 数据类型 - float64
- prompt: 数据类型 - string
- raw_output: 数据类型 - string
- deparment_obj: 数据类型 - string
- gpt-4-turbo_cost: 数据类型 - float64
- sysdep_obj: 结构体
- deparment: 结构体
- inferred: 数据类型 - bool
- jobrole_deparment: 数据类型 - string
- focus_areas: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- industries: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- products_and_technologies: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- deparment: 结构体
- prompt_dep: 数据类型 - string
- raw_output_inf_dep: 数据类型 - string
- mod_sysdep_obj: 结构体
- department: 结构体
- inferred: 数据类型 - bool
- team: 数据类型 - string
- toplevel_department: 数据类型 - string
- focus_areas: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- industries: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- products_and_technologies: 列表
- description: 数据类型 - string
- subject: 数据类型 - string
- department: 结构体
- mod_mod_sysdep_obj_raw: 数据类型 - string
- department: 数据类型 - string
- mod_dep_raw: 数据类型 - string
- mod_answer: 数据类型 - string
数据集分割
- train:
- 字节数: 65412544
- 示例数: 2228
数据集大小
- 下载大小: 27861957
- 数据集大小: 65412544
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*
- split: train
- data_files:



