llama-duo/synth_summarize_dataset
收藏Hugging Face2024-05-31 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/llama-duo/synth_summarize_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: prompt
dtype: string
- name: prompt_id
dtype: string
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: category
dtype: string
- name: generator
dtype: string
- name: seed_prompt
dtype: string
splits:
- name: test
num_bytes: 89079
num_examples: 25
- name: train_sft_gpt4o
num_bytes: 1182529290
num_examples: 300838
- name: train_sft_gemini1_5flash
num_bytes: 1171342037
num_examples: 301401
- name: train_sft_claude3sonnet
num_bytes: 1301414410
num_examples: 301005
download_size: 581879012
dataset_size: 3655374816
configs:
- config_name: default
data_files:
- split: test
path: data/test-*
- split: train_sft_gpt4o
path: data/train_sft_gpt4o-*
- split: train_sft_gemini1_5flash
path: data/train_sft_gemini1_5flash-*
- split: train_sft_claude3sonnet
path: data/train_sft_claude3sonnet-*
---
The dataset includes multiple features such as prompt, prompt_id, messages (containing content and role), category, generator, and seed_prompt. It is divided into several parts including test, train_sft_gpt4o, train_sft_gemini1_5flash, and train_sft_claude3sonnet, each with its corresponding byte count and number of examples. Additionally, the download size and total size of the dataset are provided.
提供机构:
llama-duo
原始信息汇总
数据集信息
特征
- prompt: 类型为字符串
- prompt_id: 类型为字符串
- messages: 列表类型,包含以下字段:
- content: 类型为字符串
- role: 类型为字符串
- category: 类型为字符串
- generator: 类型为字符串
- seed_prompt: 类型为字符串
数据分割
- test: 字节数为89079,样本数为25
- train_sft_gpt4o: 字节数为1182529290,样本数为300838
- train_sft_gemini1_5flash: 字节数为1171342037,样本数为301401
- train_sft_claude3sonnet: 字节数为1301414410,样本数为301005
数据大小
- 下载大小: 581879012字节
- 数据集大小: 3655374816字节
配置
- default: 包含以下数据文件路径:
- test:
data/test-* - train_sft_gpt4o:
data/train_sft_gpt4o-* - train_sft_gemini1_5flash:
data/train_sft_gemini1_5flash-* - train_sft_claude3sonnet:
data/train_sft_claude3sonnet-*
- test:



