amuvarma/interleaved_25k
收藏Hugging Face2024-11-24 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/amuvarma/interleaved_25k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:input_ids(整数序列)、attention_mask(字节序列)和labels(长整数序列)。数据集被划分为训练集,共有25674个样本,大小为1708526392字节。数据集配置为默认配置,训练数据文件以train-开头。
The dataset includes three fields: input_ids (integer sequence), attention_mask (byte sequence), and labels (long integer sequence). The dataset is split into a training set with 25674 samples, totaling 1708526392 bytes in size. The dataset is configured with the default configuration, and the training data files start with train-.
提供机构:
amuvarma



