Hieuman/gmane_dataset
收藏Hugging Face2025-03-04 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Hieuman/gmane_dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了作者的标识符(authorIDs)和完整的文本内容(fullText)。它被划分为训练集,共有1917151个样本,数据集大小为7705285092字节。
The dataset includes author identifiers (authorIDs) and full text content (fullText). It is split into a training set with a total of 1,917,151 examples, and the dataset size is 7,705,285,092 bytes.
提供机构:
Hieuman



