timaeus/wikipedia_en_max_loss_delta_ablation_l0h6
收藏Hugging Face2025-03-18 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/wikipedia_en_max_loss_delta_ablation_l0h6
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本数据和元数据信息,文本数据以字符串形式存在,元数据中包含一个名为pile_set_name的字段,用于表示每个样本所属的集合名称。数据集分为训练集,共有10000个样本,数据集大小为37595739字节,下载大小为21488634字节。
The dataset includes text data and metadata information, with the text data existing as strings. The metadata contains a field named pile_set_name, which represents the set name to which each sample belongs. The dataset is split into a training set, with a total of 10,000 samples, and the dataset size is 37,595,739 bytes, with a download size of 21,488,634 bytes.
提供机构:
timaeus



