AI4BD/all-data-dep
收藏Hugging Face2025-03-05 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/AI4BD/all-data-dep
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个字段:id和text。text字段中包含文本数据,id字段可能用于标识每个文本。训练集包含大约13亿7千5百33万5千条示例,数据集总大小为约654TB。这是一个大规模的文本数据集,但没有具体的主题或内容描述。
The dataset includes two fields: id and text. The text field contains textual data, and the id field may be used to identify each text. The training set contains approximately 1,375,335,000 examples, with the total dataset size being about 654TB. This is a large-scale text dataset, but there is no specific description of the topic or content.
提供机构:
AI4BD



