bikingSolo/CoolDatasetForLLM_PracLLM
收藏Hugging Face2024-12-25 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/bikingSolo/CoolDatasetForLLM_PracLLM
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和元数据信息,文本字段为字符串类型,元数据字段包含来源和URL。数据集分为训练集和测试集,训练集包含1,344,656个示例,大小为7,195,322,812.69字节;测试集包含336,164个示例,大小为1,798,830,703.17字节。整个数据集的大小为8,994,153,515.86字节。
The dataset includes text and metadata information, with the text field being of string type and the metadata field containing source and URL. The dataset is split into a training set and a test set, with the training set containing 1,344,656 examples and sized at 7,195,322,812.69 bytes; the test set contains 336,164 examples and is 1,798,830,703.17 bytes in size. The entire dataset is 8,994,153,515.86 bytes in size.
提供机构:
bikingSolo



