sumuks/purple_wintermute_0.2_training_data_in_progress
收藏Hugging Face2025-01-19 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/sumuks/purple_wintermute_0.2_training_data_in_progress
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个配置,每个配置都包含用于训练的文本数据。数据集中的特征包括输入ID、注意力掩码、标签、位置ID和序列长度。数据集被划分为训练集,每个训练集包含大约761,163个样本,数据大小约为6026MB。
The dataset consists of two configurations, each containing text data for training. The features in the dataset include input IDs, attention masks, labels, position IDs, and sequence lengths. The dataset is split into training sets, each containing approximately 761,163 samples, with a data size of about 6026MB.
提供机构:
sumuks



