flydust/ShareGPT-Vicuna-unfiltered-axolotl
收藏Hugging Face2024-05-29 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/flydust/ShareGPT-Vicuna-unfiltered-axolotl
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: dataset
dtype: string
- name: id
dtype: string
- name: conversations
list:
- name: from
dtype: string
- name: value
dtype: string
splits:
- name: train
num_bytes: 422521695.52684927
num_examples: 111912
download_size: 361222437
dataset_size: 422521695.52684927
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
Convert anon8231489123/ShareGPT_Vicuna_unfiltered to sharegpt format for Axolotl SFT.
The dataset includes three main features: dataset, id, and conversations. The conversations is a list containing from and value sub-features. The dataset is divided into a training set with 111912 samples, totaling 422521695.52684927 bytes. This dataset is used to convert anon8231489123/ShareGPT_Vicuna_unfiltered to sharegpt format for use in Axolotl SFT.
提供机构:
flydust
原始信息汇总
数据集概述
数据集信息
- 特征:
- dataset: 数据集名称,类型为字符串。
- id: 数据集ID,类型为字符串。
- conversations: 对话列表,包含以下字段:
- from: 对话来源,类型为字符串。
- value: 对话内容,类型为字符串。
数据集分割
- train:
- num_bytes: 422521695.52684927 字节
- num_examples: 111912 个样本
数据集大小
- download_size: 361222437 字节
- dataset_size: 422521695.52684927 字节
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*
- data_files:



