Taki135/OpenOrca_more_than_100_tokens
收藏Hugging Face2024-03-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Taki135/OpenOrca_more_than_100_tokens
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
dataset_info:
features:
- name: id
dtype: string
- name: system_prompt
dtype: string
- name: question
dtype: string
- name: response
dtype: string
splits:
- name: train
num_bytes: 2927171424.10108
num_examples: 1716231
download_size: 2007021496
dataset_size: 2927171424.10108
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
This dataset includes four main features: id, system_prompt, question, and response, all of which are of string type. The dataset is divided into a training set (train) with 1716231 samples, totaling 2927171424.10108 bytes. The download size of the dataset is 2007021496 bytes. The configuration name of the dataset is default, and the training data files are located at data/train-* path.
提供机构:
Taki135
原始信息汇总
数据集概述
数据集信息
- 特征(Features):
- id: 数据类型为字符串(string)
- system_prompt: 数据类型为字符串(string)
- question: 数据类型为字符串(string)
- response: 数据类型为字符串(string)
数据分割(Splits)
- 训练集(train):
- 字节数: 2927171424.10108
- 样本数: 1716231
数据集大小
- 下载大小: 2007021496
- 数据集大小: 2927171424.10108
配置(Configs)
- 默认配置(default):
- 数据文件:
- 分割: 训练集(train)
- 路径: data/train-*
- 数据文件:



