preference-agents-experiments/enron-standardized-jeff-dasovich-20-split-rulegen-training-data-70brules
收藏Hugging Face2024-06-07 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/preference-agents-experiments/enron-standardized-jeff-dasovich-20-split-rulegen-training-data-70brules
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: metadata
dtype: string
- name: input
dtype: string
- name: output
dtype: string
- name: small_model_baseline
dtype: string
- name: large_model_baseline
dtype: string
- name: large_model_with_70b_rules
dtype: string
- name: large_model_with_8b_rules
dtype: string
- name: large_model_with_nobaseline_rules
dtype: string
- name: no_baseline_rules
dtype: string
- name: with_70b_baseline_rules
dtype: string
- name: with_8b_baseline_rules
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 7517644
num_examples: 797
- name: test
num_bytes: 1893157
num_examples: 200
download_size: 4382272
dataset_size: 9410801
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
---
The dataset includes multiple features such as id, metadata, input, output, etc., with each feature being of string type. The dataset is divided into training and test sets, containing 797 and 200 samples respectively. The download size of the dataset is 4382272 bytes, and the total size is 9410801 bytes. The dataset configuration is set to default, with the training and test set data files stored in data/train-* and data/test-* paths respectively.
提供机构:
preference-agents-experiments
原始信息汇总
数据集概述
数据集特征
- id: 数据类型为字符串
- metadata: 数据类型为字符串
- input: 数据类型为字符串
- output: 数据类型为字符串
- small_model_baseline: 数据类型为字符串
- large_model_baseline: 数据类型为字符串
- large_model_with_70b_rules: 数据类型为字符串
- large_model_with_8b_rules: 数据类型为字符串
- large_model_with_nobaseline_rules: 数据类型为字符串
- no_baseline_rules: 数据类型为字符串
- with_70b_baseline_rules: 数据类型为字符串
- with_8b_baseline_rules: 数据类型为字符串
- text: 数据类型为字符串
数据集分割
- train: 包含797个示例,总大小为7517644字节
- test: 包含200个示例,总大小为1893157字节
数据集大小
- 下载大小: 4382272字节
- 数据集总大小: 9410801字节
配置文件
- config_name: default
- data_files:
- train: 路径为
data/train-* - test: 路径为
data/test-*
- train: 路径为



