kilian-group/LMLM-pretrain-dwiki6.1M_v2
收藏Hugging Face2025-09-14 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/kilian-group/LMLM-pretrain-dwiki6.1M_v2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如标注文本、原始文本、模型ID、提示ID等。它被划分为了训练集,大小为35927999876字节,共有6245642个示例。数据集的具体内容和用途在README中未提及,因此无法给出详细的中文描述。
The dataset contains multiple fields such as annotated text, original text, model ID, prompt ID, etc. It is split into a training set, which is 35927999876 bytes in size and contains 6245642 examples. The specific content and purpose of the dataset are not mentioned in the README, so no detailed English description can be provided.
提供机构:
kilian-group



