TwinDoc/dataset-sft-gpt-redwhale-1m
收藏Hugging Face2024-08-02 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/TwinDoc/dataset-sft-gpt-redwhale-1m
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为messages的特征,该特征是一个列表,列表中包含content和role两个字段,数据类型均为字符串。数据集分为一个训练集(train),包含1,122,566个样本,总大小为1,034,702,159字节。下载大小为548,944,774字节,数据集总大小与训练集大小相同。
The dataset contains a feature named messages, which is a list containing two fields: content and role, both of which are of string data type. The dataset is divided into a training set (train) with 1,122,566 samples and a total size of 1,034,702,159 bytes. The download size is 548,944,774 bytes, and the total dataset size is the same as the training set size.
提供机构:
TwinDoc



