SKNahin/dolmino-mix-wiki
收藏Hugging Face2025-10-15 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/SKNahin/dolmino-mix-wiki
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多个字段,如添加时间、创建时间、唯一标识符、来源、文本内容等,并提供了元数据信息,如长度、来源、修订版本和URL。数据集分为训练集,大小为17,428,311,937字节,共有6,171,220个样本。数据集配置中包含了默认配置,指定了训练集的数据文件路径。
The dataset includes multiple fields such as the time added, creation time, unique identifier, source, text content, and metadata information like length, provenance, revision ID, and URL. The dataset is split into a training set, which is 17,428,311,937 bytes in size and contains 6,171,220 samples. The dataset configuration includes a default setting that specifies the path to the training set data files.
提供机构:
SKNahin



