Rijgersberg/common_corpus_nl_3
收藏Hugging Face2025-02-14 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Rijgersberg/common_corpus_nl_3
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含了文本信息的数据集,其中包括了文本的标识符、所属收藏集、开放类型、许可证信息、创建日期、标题、创建者、语言、语言类型、词数、token数和文本内容等特征。数据集以训练集的形式提供,共有317499个示例,总大小约为4.7GB。数据集支持默认配置,可通过指定路径来加载数据。
This is a dataset containing text information, which includes features such as text identifier, collection, open type, license, creation date, title, creator, language, language type, word count, token count, and text content. The dataset is provided in the form of a training set, with a total of 317,499 examples and a total size of approximately 4.7GB. The dataset supports a default configuration and can be loaded by specifying the path.
提供机构:
Rijgersberg



