NobodyExistsOnTheInternet/dedupedsharegptsilly
收藏Hugging Face2024-01-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/NobodyExistsOnTheInternet/dedupedsharegptsilly
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: model
dtype: string
- name: conversation
list:
- name: from
dtype: string
- name: value
dtype: string
- name: chat_string
dtype: string
splits:
- name: train
num_bytes: 169980583
num_examples: 6847
download_size: 92511871
dataset_size: 169980583
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes three main features: model, conversation, and chat_string. model is a string type feature, conversation is a list containing two sub-features: from and value, both of which are also string types. chat_string is also a string type feature. The dataset is divided into a training set with 6847 samples, with a total size of 169980583 bytes. The download size of the dataset is 92511871 bytes. The dataset configuration is default, and the training data files are located at data/train-* path.
提供机构:
NobodyExistsOnTheInternet
原始信息汇总
数据集概述
数据集特征
- model: 数据类型为字符串。
- conversation: 包含以下子特征:
- from: 数据类型为字符串。
- value: 数据类型为字符串。
- chat_string: 数据类型为字符串。
数据集划分
- train: 包含6847个样本,占用169980583字节。
数据集大小
- 下载大小: 92511871字节。
- 实际大小: 169980583字节。
配置
- default: 包含训练数据文件,路径为
data/train-*。



