five

ZorraZabb/code25wiki75_sampling_xml_fitered

收藏
Hugging Face2024-09-27 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ZorraZabb/code25wiki75_sampling_xml_fitered
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含文本数据,主要分为训练集和测试集两部分。训练集包含999,000个样本,总大小为4,585,763,537.508584字节;测试集包含1,000个样本,总大小为4,590,353.891399983字节。整个数据集的下载大小为2,190,765,309字节,总数据集大小为4,590,353,891.399984字节。数据集的配置文件中指定了训练集和测试集的数据文件路径。

This dataset contains text data, primarily divided into a training set and a test set. The training set consists of 999,000 samples with a total size of 4,585,763,537.508584 bytes; the test set consists of 1,000 samples with a total size of 4,590,353.891399983 bytes. The entire dataset has a download size of 2,190,765,309 bytes and a total dataset size of 4,590,353,891.399984 bytes. The configuration file of the dataset specifies the data file paths for the training and test sets.
提供机构:
ZorraZabb
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作