Saibo-creator/synthie-LLM-generation
收藏Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Saibo-creator/synthie-LLM-generation
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: int64
- name: input
dtype: string
- name: output
dtype: string
- name: prompt
dtype: string
- name: label
dtype: string
splits:
- name: GPT4_unconstrained
num_bytes: 2255169
num_examples: 1000
- name: GPT3.5_unconstrained
num_bytes: 2241194
num_examples: 1000
- name: Claude_unconstrained
num_bytes: 2271719
num_examples: 997
- name: Claude_Instant_unconstrained
num_bytes: 2343434
num_examples: 997
- name: llama2_7B_unconstrained
num_bytes: 2384462
num_examples: 1000
- name: llama2_13B_unconstrained
num_bytes: 2406316
num_examples: 1000
- name: llama_33B_unconstrained
num_bytes: 2411478
num_examples: 1000
- name: llama2_70B_unconstrained
num_bytes: 2410426
num_examples: 1000
download_size: 3967063
dataset_size: 18724198
configs:
- config_name: default
data_files:
- split: GPT4_unconstrained
path: data/GPT4_unconstrained-*
- split: GPT3.5_unconstrained
path: data/GPT3.5_unconstrained-*
- split: Claude_unconstrained
path: data/Claude_unconstrained-*
- split: Claude_Instant_unconstrained
path: data/Claude_Instant_unconstrained-*
- split: llama2_7B_unconstrained
path: data/llama2_7B_unconstrained-*
- split: llama2_13B_unconstrained
path: data/llama2_13B_unconstrained-*
- split: llama_33B_unconstrained
path: data/llama_33B_unconstrained-*
- split: llama2_70B_unconstrained
path: data/llama2_70B_unconstrained-*
---
数据集信息:
特征字段:
- 字段名:id,数据类型:64位整数(int64)
- 字段名:输入(input),数据类型:字符串
- 字段名:输出(output),数据类型:字符串
- 字段名:提示词(prompt),数据类型:字符串
- 字段名:标签(label),数据类型:字符串
划分集:
- 划分名称:无约束GPT4(GPT4_unconstrained),数据字节数:2255169,样本数量:1000
- 划分名称:无约束GPT3.5(GPT3.5_unconstrained),数据字节数:2241194,样本数量:1000
- 划分名称:无约束Claude(Claude_unconstrained),数据字节数:2271719,样本数量:997
- 划分名称:无约束Claude Instant(Claude_Instant_unconstrained),数据字节数:2343434,样本数量:997
- 划分名称:无约束Llama2_7B(llama2_7B_unconstrained),数据字节数:2384462,样本数量:1000
- 划分名称:无约束Llama2_13B(llama2_13B_unconstrained),数据字节数:2406316,样本数量:1000
- 划分名称:无约束Llama_33B(llama_33B_unconstrained),数据字节数:2411478,样本数量:1000
- 划分名称:无约束Llama2_70B(llama2_70B_unconstrained),数据字节数:2410426,样本数量:1000
下载大小:3967063,数据集总大小:18724198
配置项:
- 配置名称:默认(default),数据文件:
- 对应划分:无约束GPT4(GPT4_unconstrained),路径:data/GPT4_unconstrained-*
- 对应划分:无约束GPT3.5(GPT3.5_unconstrained),路径:data/GPT3.5_unconstrained-*
- 对应划分:无约束Claude(Claude_unconstrained),路径:data/Claude_unconstrained-*
- 对应划分:无约束Claude Instant(Claude_Instant_unconstrained),路径:data/Claude_Instant_unconstrained-*
- 对应划分:无约束Llama2_7B(llama2_7B_unconstrained),路径:data/llama2_7B_unconstrained-*
- 对应划分:无约束Llama2_13B(llama2_13B_unconstrained),路径:data/llama2_13B_unconstrained-*
- 对应划分:无约束Llama_33B(llama_33B_unconstrained),路径:data/llama_33B_unconstrained-*
- 对应划分:无约束Llama2_70B(llama2_70B_unconstrained),路径:data/llama2_70B_unconstrained-*
提供机构:
Saibo-creator
原始信息汇总
数据集概述
数据集特征
- id: 整数类型 (int64)
- input: 字符串类型 (string)
- output: 字符串类型 (string)
- prompt: 字符串类型 (string)
- label: 字符串类型 (string)
数据集分割
- GPT4_unconstrained: 1000个样本,占用2255169字节
- GPT3.5_unconstrained: 1000个样本,占用2241194字节
- Claude_unconstrained: 997个样本,占用2271719字节
- Claude_Instant_unconstrained: 997个样本,占用2343434字节
- llama2_7B_unconstrained: 1000个样本,占用2384462字节
- llama2_13B_unconstrained: 1000个样本,占用2406316字节
- llama_33B_unconstrained: 1000个样本,占用2411478字节
- llama2_70B_unconstrained: 1000个样本,占用2410426字节
数据集大小
- 下载大小: 3967063字节
- 数据集总大小: 18724198字节
配置文件
- config_name: default
- data_files:
- split: 对应的数据分割名称
- path: 数据文件路径,格式为
data/{split名称}-*



