genloop/bloomberg-news-articles-pretraining-dataset
收藏Hugging Face2024-11-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/genloop/bloomberg-news-articles-pretraining-dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为text的字符串类型特征。数据集分为一个训练集,包含436,869个样本,总大小为1,234,232,423字节。下载大小为725,675,795字节。默认配置下的数据文件路径为data/train-*。
The dataset contains a feature named text with a string data type. The dataset is divided into a training set with 436,869 samples, totaling 1,234,232,423 bytes. The download size is 725,675,795 bytes. The default configuration specifies the data file path as data/train-*.
提供机构:
genloop



