Lucidest/7.21-reddit-classify

Name: Lucidest/7.21-reddit-classify
Creator: Lucidest
Published: 2024-07-21 15:38:11
License: 暂无描述

Hugging Face2024-07-21 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/Lucidest/7.21-reddit-classify

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含三个特征：prompt（提示）、completion（完成）和split（分割），均为字符串类型。数据集分为一个训练集（train），包含232073个样本，总字节数为208232069。数据集的下载大小为104821096字节，数据集大小为208232069字节。配置文件中有一个默认配置（default），其数据文件路径为data/train-*。

The dataset includes three main features: prompt, completion, and split, all of which are string type. The dataset is divided into a training set (train) with 232073 samples, totaling 208232069 bytes. The download size of the dataset is 104821096 bytes, and the dataset size is 208232069 bytes. There is a default configuration in the config file, with the data file path being data/train-*.

提供机构：

Lucidest

原始信息汇总

数据集概述

数据集信息

特征:
- prompt: 数据类型为字符串。
- completion: 数据类型为字符串。
- split: 数据类型为字符串。

数据分割

train:
- 字节数: 208232069
- 样本数: 232073

数据集大小

下载大小: 104821096
数据集大小: 208232069

配置

配置名称: default
- 数据文件:
  - 分割: train
  - 路径: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集