timaeus/pile-uspto_backgrounds
收藏Hugging Face2025-04-03 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/pile-uspto_backgrounds
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和元数据两个主要特征。文本特征为字符串类型,元数据特征包含一个名为pile_set_name的字符串类型字段。数据集包含一个训练分割,共有100,000个样本,总大小为428,010,825字节。数据集的下载大小为203,458,463字节,总数据集大小与训练分割的大小相同。数据集的配置文件名为default,数据文件路径为data/train-*。
This dataset contains two main features: text and meta. The text feature is of string type, and the meta feature is a structure containing a string-type field named pile_set_name. The dataset includes a train split with 100,000 samples and a total size of 428,010,825 bytes. The download size of the dataset is 203,458,463 bytes, and the total dataset size is the same as the size of the train split. The configuration file for the dataset is named default, and the data file path is data/train-*.
提供机构:
timaeus



